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(57) Abstract 

Proteins having a telomerasc activity and being useful in screening telomerase inhibitors, developing diagnostic methods with the use 
of antibodies, etc. Conjugated proteins including proteins having a telomerase activity which have the following physicochemical properties: 
function and substrate specificity: catalyzing the elongation of the telomere DNA 3'-OH terminal of chromosomes of eukaryotes; molecular 
weight: ranging from about 40 to about 120 kD, in particular, about 110, 58 and/or 45kO in some cases, as determined by SDS-PAGE; 
inactivation: inactivated by treating with RNasc; and characteristic: binding to mouse telomerase RNA. 
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Specification 
Proteins Having Telomerase Activity 

Technical Field 

This invention is related to proteins having telomerase activity. In more detail, this invention is 
related to proteins having telomerase activity, which were purified from rat-derived tissue 
cultured cells. 

Technical Background 

Telomerase is known to catalyze elongation of the telomere end (terminal portion of linear 
chromosome) and many investigations have been conducted on this subject (Greider & 
Blackburn, Cell 51, 887-898, 1987; Morine, Cell 59, 521-529, 1989). In eukaryotes, telomerase 
is known as a type of reverse transcriptase which contains complementary template RNA to the 
telomere DNA sequence, 5 5 TTAGGG3 ' , and elongates the single chain of telomere DNA 
based on the template RNA. At 3 ' end of linear DNA in eukaryotes, single chain of 5 ' 
(TTAGGG) n 3 ' is protruding and thus this structure functions as primer for the telomerase 
reaction. 

While telomerase activity can not be detected in normal cells except some cells such as 
hemopoietic stem cells, its strong activity can be detected in most tumor cells. As such, 
telomerase is though to be involved with maintenance of infinite proliferation of tumor cells. 
Since telomerase is selectively detected in tumor cells as mentioned above, it is noteworthy as a 
target for antitumor agents. 

Although enzyme proteins for telomerase have not been purified for a long period of time 
throughout all organisms, telomerase from tetrahymena has been recently purified and its cDNA 
has also been cloned (Collins et al, Cell 81, 677-786, 1995). 

However, there has been no report of purified protein with telomerase activity from mammals 
such as rat and human. 

Disclosure of Invention 

One objective of this invention is to provide new proteins having telomerase activity, in 

particular proteins having telomerase activity derived from mammals. 

Another objective of this invention is to establish a procedure to purify proteins having 

telomerase activity utilizing TRAP-SPA method, one of the methods to measure telomerase 

activity. 

To achieve the above objectives, this inventor studied conditions in detail to purify proteins 
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having telomerase activity from rat hepatocytes as starting materials. As a result, this inventor 
successfully obtained a fraction containing relatively high telomerase activity by combining 
ammonium sulfate precipitation and purification treatments by certain column chromatographies 
and completed this invention. 

That is, this invention provide conjugated proteins including proteins having telomerase activity 
with the following physicochemical properties: 

function and substrate specificity: catalyzing elongation of telomere DNA 3 ' -OH terminal of 
chromosomes of eukaryotes; 

molecular weight: ranging approximately 40-120 kD as determined by SDS-PAGE; 
inactivation: inactivated by treating with RNase; 
characteristic: binding to mouse telomerase RNA. 

The molecular weights of above proteins having telomerase activity are approximately 1 10, 58 
and/or 45kD by SDS-PAGE. 

Furthermore, in another view, this invention provides proteins having telomerase activity as 
characterized by the following physicochemical properties: 

function and substrate specificity: catalyzing elongation of telomere DNA 3 ' -OH terminal of 
chromosomes of eukaryotes; 

molecular weight: approximately 1 10 kD, approximately 58 kD or approximately 45 kD as 

determined by SDS-PAGE; 

inactivation: inactivated by treating with RNase; 

characteristic: binding to mouse telomerase RNA. 

The proteins having telomerase activity in this invention can be obtained by purifying them, 
preferably from rat hepatocytes. 

In addition, the proteins having telomerase activity can be obtained preferably through 
ammonium sulfate precipitation of cell extract, a gel-filtration chromatography, a cation- 
exchange chromatography and a subsequent gel-filtration chromatography. 
Inactivation by RNase treatment can be raised as one of characteristics of these proteins having 
telomerase activity in this invention. 

This is because telomerase is riboprotein (complex between protein and RNA) requiring a RNA 
subunit to express its activity (functions as a template when elongating the telomere sequence). 
That is, when telomerase is treated by RNase, its RNA subunit is destroyed. Telomerase then 
loses the template for elongation of telomere sequence and is inactivated. 

Concise Explanation of Figures 

Figure 1 depicts 3 H-incorporation of fractions obtained by gel-filtration chromatography using a 
Sephacryl S-300 column. 
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Figure 2 depicts H-incorporation of fractions obtained by cation-exchange chromatography 
using HyLoad SP column. 

Figure 3 depicts 3 H-incorporation of fractions obtained by gel-filtration chromatography using a 
Sephacryl S-400 column 

The Best Form to Practice the Invention 

We will list examples for preparation method of proteins having telomerase activity below. 
It is known that telomerase activity can be detected in reproductive organs and tumor cells in 
human. There is also a report that hepatocytes from rat show high telomerase activities at 
stationary and growth phases. Accordingly, proteins having telomerase activity in this invention 
can be prepared from materials (cultured cells and tissues) containing telomerase activity. For 
example, in an example described later, a preparation was carried out using rat AH7974 cells. 
Preparation methods for proteins having telomerase activity from materials containing telomerase 
activity in this invention are not limited to specific methods, but generally include proper 
combination of a salting out with inorganics (for example, ammonium sulfate, alkali metal sulfate 
salts, alkali metal halides), precipitation with organic solvents using hydrophilic organic solvents 
etc. (for example, alcohols such as ethanol or isopropyl alcohol, ketones such as acetone), 
adsorption and release methods using ion-exchange resins and a variety of column 
chromatographies, gel filtration, ultrafiltration and protein precipitating agents (for example, 
nucleic acids, tannin etc.). 

Moreover, proteins having telomerase activity thus obtained can be further purified by isoelectric 
precipitation, dialysis, electric dialysis and electrophoresis. 

For example, extract of rat hepatocytes is salted out with 40-60% ammonium sulfate. The 
precipitate is dissolved in buffer, filtered and fractionated by gel filtration (Sephacryl S-300 
column). The active fraction is fractionated by cation-exchange chromatography (HyLoad SP 
column etc.). The active fraction is salted out with 60% ammonium sulfate. The precipitate is 
dissolved into buffer, filtered and further purified by gel filtration chromatography (Sephacryl S- 
400 column etc.). After collecting the active fraction, 8% SDS-PAGE analysis demonstrates three 
bands corresponding approximately to 1 10, 58 and 45 kD. 

In addition, proteins having telomerase activity in this invention is characterized by its binding to 
mouse telomerase RNA. This binding can be measured by adding labeled mouse telomerase 
RNA to gel containing proteins having telomerase activity and performing autoradiography after 
a specified period of time as described in the following example. 

There have been several reports regarding an assay method for telomerase activity, which is a key 
procedure for purification of telomerase. For example, with regards to telomerase activity 
detection for tetrahymena, there has been a report on telomerase detection dealing with 
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elongation of single primer. However, this has had problems with sensitivity, time required for 
detection, quantitativeness and amount of samples it can handle. To solve these problems, a 
method called TRAP (telomeric repeat amplification protocol), which is based on polymerase 
chain reaction, has been developed (Kim et aL, Science 206, 201 1-2015, 1994; Piatyszek et al, 
Meth Cell Sci 17, 1-15, 1995), and sensitivity and time required for detection have been 
improved. 

However, the TRAP method still requires analysis of 32 P-labeled reaction products or 
fluorescence-labeled reaction products by polyacrylamide gel-electrophoresis or HPLC. This 
creates issues related to the sample size, handling of 32 P, time requirement for the procedures and 
delay required for quantitative determination of band visibility. 

To solve the above problems, a method called TRAP-SPA, as a rapid and sensitive method to 
detect and quantitate telomerase activity, has been recently reported (Application Number 1996, 
17830). 

This invention provides new proteins having telomerase activity, does not limit telomerase assay 
methods, and any methods mentioned above can be used. In the following examples, the TRAP- 
SPA method, which is the most desirable as it is rapid and sensitive, is utilized. 

Examples 

Example 1 : Preparation of CHAPS Extract 

AH7974 cells (volume: 40-50 ml, equivalent to those obtained from six rats) were rinsed twice 
by suspending them in chilled phosphate buffer and centrifuging them. These cells were then 
rinsed with rinsing buffer (10 mM Hepes pH 7.5; 1.5 mM MgCl 2 ; 10 mM KC1; 1 mM DTT), 
centrifuged and suspended into 10 ml cell extract buffer (10 mM Tris-HCl pH 7.5; 1 mM MgCb; 
1 mM EGTA; 1 mM PMSF; 5 mM 2-mercaptoethanol; 0.5% CHAPS; 10% glycerol). The 
suspension was kept on ice for 30 min and centrifuged at 15,000 rpm for 30 min to obtain the 
supernatant. This fraction had 30 mg protein/ml x 30 ml. This cell extract was used for the 
following purification. 

Example 2: Ammonium Sulfate Precipitation 

To the cell extract obtained at Example 1, 1 M Tris-HCl pH 8.3 was added at the ratio of 1/30, 
and 40-60% ammonium sulfate fraction was collected. This fraction was dissolved into 5-10 ml 
buffer D containing 20% glycerol (20 mM Tris-HCl pH 7.5; 1 mM EDTA; 1 mM sodium 
bisulfate; 0.01% NP-40; 10%? glycerol; 1 mM benzamidine) and filtered (0.45 jim). The solution 
thus obtained had 42.78 mg protein/ml x 7 ml. 

Example 3: Sephacryl S-300 Gel Filtration Column Chromatography 
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The ammonium sulfate fraction obtained in Example 2 was fractionated by a 45 mm x 50 cm 
Sephacryl S-300 (Pharmacia) column, which was equilibrated with buffer D (described in 
Example 2). The fractions obtained were analyzed by TRAP-SPA method as follows: to detect 
3 H incorporation which is specific to telomerase activity, the TRAP-SPA method was performed 
on both RNase (RNase +) treated and untreated (RNase -) samples from each fraction. The 
RNase treatment was carried out by adding 1 jil RNase (10 mg/ml) to a 20 |il sample and 
incubating it for 10 min at 30°C. 
(TRAP-SPA method) 

TS primer (sequence number 1) and CX primer (sequence number 2) were synthesized using a 
DNA synthesizer (ABI). Biotinyl-CX and -TS primers were synthesized by coupling 
biotinLCbiotin-ON™phosphoramidite?(Clonetech) to 5 ' terminal of oligonucleotides. Primers 
were purified using CPC? column (ABI) (based on their instruction), lyophilized, treated with 
DEPC (diethylpyrocarbonate?) and suspended into water. 

BiotinylCX primers (0.01-0.1 |ig) (Biot-CX) were trapped in the presence of wax? in a Hot-Start 
tube (GIBCO-BRL). 

The eluate fractions (20 jo.1) (RNase treated or untreated) were incubated with 20 mM Tris-HCl 
pH 8.3, 1.5 mM MgCl 2 , 63 mM KC1, 0.005% Tween 20, 1 mM EGTA, 50 ^iM dATP and dCTP, 
2 [iM dTTP, 50 |iM dGTP, 2 ^Ci [Me- 3 H]TTP (Amersham, 1 14 Ci/mmol),'0. 1 |ig BSA/ml, 2 U 
Taq polymerase and 0.1 jag or appropriate amount of TS primer in the final mixture volume of 50 
fil on wax barrier?at room temperature for 30 min. 

To amplify the synthesized telomere oligonucleotides, the mixture was heated at 90°C for 90 sec, 
at 94°C for 30 sec, at 50°C for 30 sec and 72°C for 45 sec for one cycle. PCR was carried out for 
31 cycles. 

Reaction products (40 |il) were transferred to 96 wellplate (Wallac), 50 jil fluoromicrosphere (in 

0.56 M EDTA, 1 :4 solution) precoated with streptoavidin was added and incubated for 10 min at 

37°C to let biotinylated 3 H-labeled reaction product to bind to streptoavidin beads. 

The plate was counted by MicroBeta Scintillation Counter (Wallac). 

The relationship between each fraction and 3 H-incorporation is depicted in Figure 1 . 

The fractions containing telomerase activity (elution volume was 315-375 ml: fraction numbers 

in Figure 1 were 21-25) were collected. This fraction showed 1.3 mg protein/ml x 75 ml. 

Example 4: Cation Exchange Chromatography by HyLoad SP Column 
The active gel filtration fraction (75 ml) obtained in Example 3 was applied to HiLoad? SP 
column (Pharmacia: 16 mm x 10 cm) which was equilibrated with buffer D and a KC1 gradient 
(0.0-1.0 M) was instituted. Each fraction was analyzed by TRAP-SPA method as described in 
Example 3. Figure 2 depicts 3 H-incorporation of each fraction. 
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The fraction with telomerase activity (27.5 ml, fraction numbers 43-47 in Figure 2), which was 
eluted with 0.1-0.3 M KG, was collected. 

Proteins in this fraction were precipitated with ammonium sulfate (60% saturation) and dissolved 
into 4 ml buffer D containing 20% glycerol. The protein concentration at this point was 1.0 
mg/ml. This sample was filtered (0.45 (i). 

Example 5: Gel Filtration Chromatography by Sephacryl S-400 

The material purified by a HiLoad? column in Example 4 was further fractionated by applying it 
to a 15 mm x 75 cm Sephacryl S-400 column (Pharmacia), which was equilibrated with buffer D 
with 50 mM KG. Each fraction was analyzed by the TRAP-SPA method as described in 
Example 3. Figure 3 depicts the relationship between each fraction and 3 H-incorpo ration. 
The fraction containing telomerase activity (fraction number 22 in Figure 3: 3 ml) was collected. 
The protein concentration of this fraction was 0.289 mg/ml. 

Example 6: Analysis by SDS-PAGE 

The fraction containing telomerase activity in Example 5 (fraction number 22 in Figure 3) was 
analyzed by SDS-PAGE using 8-10% gel concentration. 

As a result, multiple bands were detected between approximately 40 kD and approximately 120 
kD, particularly detecting strong bands at approximately 1 10, 58 and 45 kD. 

Example 7:Binding of Telomerase RNA to a Subunit of Proteins Having Telomerase Activity 

(1) Cloning of Mouse Telomerase RNA 

Mouse telomerase RNA gene was amplified by RT-PCR method using mouse liver total RNA 
(Clonetech, catalog #64042-1) as a source, and subcloning was performed at Smal site of pGEM- 
3Zf (-) vector (pGEM3Zf/mTR). The utilized 5 ' and 3 ' primers are as follows: 
5 9 -GGG GTA TTT AAG GTC GAG GGC GGC-3 ' 
5 ' -TTG TGA GAA CCG AGT TCC GGG TGC-3 ' 
(sequence numbers 3 and 4). 

(2) Preparation of Mouse Telomerase RNA 

32 P-labeled telomerase RNA was prepared according to instruction using pGEM3Zf/mTR (as 
sense chain is synthesized by T7RNApolymerase, the vector in which telomerase RNA gene is 
inserted) which was cleaved by BamHl as template and Riboprobe Combination System-Sp6/T7 
(Promega) (synthsized telomerase RNA include GGG CGA AUU CGA GCU CGG UAG CCG 
and AGG GG A UC at the 5 ' and 3 ' terminals, respectively, which were derived from vector 
sequence. 

(3) Binding of Telomerase RNA to Bands Separated by SDS-PAGE Was Analyzed Based on the 
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Method of Collins et al (Cell 81, 677-686). 

The fraction with high telomerase activity (fraction number 22 in Figure 3) was fractionated by 
SDS-PAGE as described in Example 6. To achieve recoiling of proteins, SDS gel was rinsed with 
50% methanol solution for 15 min and with 10% ethanol for 4 h with shaking. In addition, after 
the gel was equilibrated with 50 mM Tris-acetate pH 8.0, 10 mM MgCh, 10% glycerol, P- 
labeled telomerase RNA was added with Yeast RNA (Sigma). After overnight incubation, the gel 
was rinsed , and autoradiography was carried out. 

As a result, multiple bands were detected between approximately 40 kD and approximately 120 
kD, particularly detecting strong bands at approximately 1 10, 58 and 45 kD. 

Potential Industrial Field of Application 

Proteins having telomerase activity in this invention are new proteins. 

Proteins having telomerase activity in this invention are useful in screening telomerase inhibitors 
and developing diagnostic methods with the use of antibodies. 

Sequence Table 

Sequence Number: 1 
Sequence Length: 18 
Sequence Pattern: nucleic acid 
Number of Chains: single chain 
Topology: linear 

Type of Sequence: other nucleic acid (synthetic DNA) 
Characteristics of Sequence: 
Other information: TS primer 
Sequence: 

AATCCGTCGA GCAGAGTT 

Sequence Number: 2 
Sequence Length: 24 
Sequence Pattern: nucleic acid 
Number of Chains: single chain 
Topology: linear 

Type of Sequence: other nucleic acid (synthetic DNA) 
Characteristics of Sequence: 
Other information: CX primer 
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Sequence: 

CCCTTACCCT TACCCTTACC CTAA 

Sequence Number: 3 
Sequence Length: 24 
Sequence Pattern: nucleic acid 
Number of Chains: single chain 
Topology: linear 

Type of Sequence: other nucleic acid (synthetic DNA) 
Sequence: 

GGGGTATTTA AGGTCGAGGG CGGC 

Sequence Number: 4 
Sequence Length: 24 
Sequence Pattern: nucleic acid 
Number of Chains: single chain 
Topology: linear 

Type of Sequence: other nucleic acid (synthetic DNA) 
Sequence: 

TTGTGAGAAC CGAGTTCCGG GTGC 

Scope of Claims 

1 . Conjugated proteins which include proteins having telomerase activity with following 
physicochemical properties: 

function and substrate specificity: catalyzing elongation of telomere DNA 3 ' -OH terminal of 
chromosomes of eukaryotes; 

molecular weight: ranging approximately 40-120 kD as determined by SOS-PAGE; 
inactivation: inactivated by treating with RNase; 
characteristic: binding to mouse telomerase RNA. 

2. Conjugated proteins within the first item of scope of claims with the characteristics that the 
molecular weights determined by SDS-PAGE of proteins having telomerase activity are 
approximately 1 10, 58 and 45 kD, 

3. Proteins having telomerase activity with characteristics of the following properties: 
function and substrate specificity: catalyzing elongation of telomere DNA 3 5 -OH terminal of 
chromosomes of eukaryotes; 
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molecular weight: approximately 1 10, 58 and 45 kD determined by SDS-PAGE; 
inactivation: inactivated by treating with RNase; 
characteristic: binding to mouse telomerase RNA. 

4. Proteins described in any of claims 1-3 with characteristics of being derived from rat 
hepatocytes. 

5. Proteins described in any of the claims 1-4 with characteristics that these proteins can be 
purified from cell extract by ammonium sulfate precipitation, gel filtration chromatography, 
cation-exchange chromatography, and gel filtration chromatography. 
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(57) Abstract 

Proteins having a telomeres* activity and being useful in screening telomcrase inhibitors, developing diagnostic methods with the use 
of antibodies, etc. Conjugated proteins including proteins having a teiomerase activity which have the following physicochcmicaJ properties: 
function and substrate specificity, catalyzing the elongation of the telomere DNA 3'-OH terminal of chromosomes of eukaryotes; molecular 
weight: ranging from about 40 to about 120 kD, in particular, about 1 10, 58 and/or 45kD in some cases, as determined by SDS-PAGE; 
inactivalion: inactivated by treating with RNase; and characteristic: binding to mouse tclomerasc RNA. 
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25 £flA&t * ; 

5**2 : SDS-P AG ETilJSLT, ftl 1 0 kD, ,#j5 8 kD*fc{«S|4 5 
kD ; 

TfSmt : RNa s e jflHICj: 0 7Sttft$n* ; 
: "V7X7^d> 7-t'RNA ; 
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CilC J: Off* c 

*^fay 5~tfffitt*tf W<^K©ttR<0— -ai LTs RN a s 
e iasir J; (9 7Stt<t <* ft S C 4 *<ajttf ^ n 5 o 

4 LT<Z>&fij£So) ^a-C*SwilIfiH-f*„ fn^7-l:li, 
RNa s eT&l$n5i-eoRNAf7a^7 F^^n^CtCiDrn 

15 

HI ft* Sephacryl S-300 ii 3 A*fflc*fc<?"JH8ii? o t h ^5 7 * -{C<k 0 
02 lis HyLoad SP it 5 HfcHK * h ^"5 7 -f 0 # 

S!3li> Sephacryl S-400* 5 ^fcffl^fc-irvUBa* nv h>'5 7 ^ -ir «fc 9 

# b $ *i tz & b # ( - *>• «* 4 3 h - a *j ix * a * ^ -r tr 5 7 -c * -5 0 
25 arc **w©ra> 5— egtt*wrs^>/<^HoBan^rftoffi*ia« 
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»8ig?I Mr a % Ba7;i/A'J4i t / 1D7 ->ft 

(Sephacryl S-300rt 5 Afc £') |-J: <5#Ii-f* 0 *©gttH#£, a 
*^D7h/ 7 7< - (HyLoad SPrt -5 Att £') K<fc <0 ftWf * 0 *©Sffil77 
£> 6 OHfcfcKKT^-^TiMrU *©a:IR*IMrttl=*JSU 7 ^ ;u 
^-SHBLfca,. YjU®&? dv h /77 -f - (Sephacryl S-400* y Aft £') 
•C$^<:^ii-r^o iSttS#*£JRU 8 HS D S - P AG ETMfrtS 4, & 
* ttl 1 OkD, «!5 8kDfcJ:Cf»!j4 5kD«:*JJC-#-*3*0/<>K*<** 
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ti<h~>tz 0 Ztl^tDZHmzm&t-fZtzlAlZ^ TRAP (telomeric repeat ampli 
fication protocol) £i? tfftS # 'J > 7 — trMRjEICS-* < 8lse£i£A<l!S3g 
3ft (Kin N.W. et al. . (1994) Science. 206. 2011-2015; Piatyszek B. A. et 
al. . (1995) Heth. Cell Sci: 17. 1-15) . J: tffc£B$p B ! 5<Z)F° , )SiJ$<&# 

bfrbtiii*io % wOTRAPftti, *U7?'J;l/7i KYd/««actt<f>H P L 
C * Hiz J: * 3 2 P -$fS5£g^3t7fc&!$En-:g^0#tfrm** £ LT^S £ 

Be £ 1 1 > -5 mm * W L T l > fc. 

*Stt*1fttiK ££-f LT. TRAP-S PA'fe^li 

n*;*i£;&<*8£$ft-cc>S (4WP8- 1 7 8 3 0) o 

SSTRAP-SPAMIbti^o 

nmm i : c h a p s mtuvwmm 

? y h 6E5t©AH 7 9 7 4JiJlS C§® : 4 0~5 OmL) £*#Lfc 'J >S5 
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(1 OmMOHe pes(pH7. 5) : 1. 5 mMOM g C 1 2 ; 1 OmM© 
KC 1 ; ImMODTT) xm^tzft. »£»U JiCStKl OmLOCHA 
PSfflKfi&lUISffi* (lOmMOT r i s-HC 1 (pH7. 5) ; ImMOM 
gCl 2 :lmMc7)EGTA;0. lmMOPMSF : 5 mMO 2 - / )lf] 7" h 

J- JU: 0. 5%OCHAPS ; 1 0 if ') -b □ -;U) KffiaBO/io *± 
II 3 0«ILfcg. 1 5. 0 0 0 r pmT3 0£IHfi<i>LT±«iS#*i»A:o 
:oi»li^>'^Kja«3 0rag/ml x3 0ml^LTl^ o COJHiaflfi 



xttfti i x'W<htLtzm^mmmznLx 3 i&t»co 1 mot r i s -h 

CI (pH8. 3) £flox.7t&, 4 0~6 0 %t&*05SK7 AS#*.#& 
L/Co ttt*T, COB»*2 0S<ry-bD-;l/*^ti'«ar»D (20mMOTr 
is-HCI CpH7. 5) ; 1 mMOE DTA ; 1 rnMON a -bisulfate) 
; 0. 0 l%<DNP-4 0 : 1 -frp-/U ; lmM©^>X7; i> » 

5~1 0mLK»»U wn^7^;U^-iSifi (0. 4 5*m) U/r 0 flfctifc 
**©*>'<*WB«J*4 2.. 7 8mg/m 1 x 7m 1 XZ>itz 0 

3 : Sephacryl S-300 5 «fc 5 Y ? n -7 f. y=, 7 _ 

*H«l2-e»4nfc«»7>*-'>Ai|^*±CLf:«Slf»D-C¥arftl/fc4 
5mmx 5 0 c mOSephacryl S-300 (Pharmacia) # 5 A T#II L £ 0 '<#£ti;t 

■a*#T®^*[TTRAP-S PASTSWrLfc. T R A P - S P Affili, x 

us?— m&iznnviKiHwo ztzbiz. *h£kohtrn 

a s et'ULt^i (RN a s e ( + ) ) £*&3I<D ( R N a s e (-) 
) ©M^KotN-c^JSLfco RNa s eS51li, S-Ii#cD-tf- VT*;!/ 2 0 m 

1 K*f U l^lORNase (1 Omg/m 1 ) fcflox.* 3 0°CT'l OftFa'R 

(TRAP-S PAS) 

TS^7-f7- (E?"i«i) Rtfcx?-?* (EyiiM2) abi 
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*><dko^ti±. ic ] ) zf? 9 i/Jrf- K© 5' ^«i:t'tf >LCt'tf >-ON 

™*X + ;U7 i ^-f h (Clontech) v 7»j V^SitS d £ ir J: »J £j£ Lfc„ 

*KS-3H-CfT->fc) , ft|g£ftU DEPC (yxfiHfto*-^-h) 
L tz 7K * «: S® S $ -£ fc, 

0. 01~0. 1 u g©h"*f- WfcCX 7*5^7 - (Biot-CX) ^-Hot-Start 
f-a-r (CIBC0-B8L) CD 7 -y ^XjfTC h5 -j 7*<*1+7to 

RK* 2 0 u 1 OSRtttia^ CRN a s e jQMLtz fecDX{i*4aSO feo) £ x 
2 OmMOT ris-HCl (pH8. 3) x 1. 5mMOMgC 1 2> 6 3m 
MCKCU 0. 00 5%OTween20, ImMOEGTA, &*50//M 
Od ATP£.tfd CTP, 2^MOdTTP, 5 0 M M© d G T P, 2 u C i © 

[Me— 3 H] TTP (Amershan, 114Ci/mmol) , 0. 1 # g/ju 1 
©BSA, 2U<0Taq#'J>7-tMW:, 0. l^gXIi^lOTSy 

°CT-9 0»i02»U »^T9 4°CT'3 0#>. 5 0°CT3 2°CT4 5 8> 

Sltlft (4 0/i 1 ) £9 6 7^1/71/- h (Ballac) C^t, 5 0 u 1 ©X 
hl/rh7t*i;>»a<!>i/J\tt?7Jl/tD7'f^DX7^ 7- (0. 56MCE 
DTAt, 1 : 4<omiii) Zljaz., 3 7^CT1 OftfiJW h Us t'^-f- 

7 b- Mis HicroBetai/ >-f U— •> 3 (ffallac) ±T'77 9 > h 

t 3 H -« 0 &*S £ ®Hft£BI 1 C^fe 

7— b*Stt**t;iB» (fctij&Sti 3 15~375mL:Ell:ti©75 
^3>ft21~25) t»«Lfc. C©H»0* 1. 3mg 

/m 1 x 7 5m 1 T*o^ 0 
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: HyLoad SP77 5 -MCcfc * a -7 h ^^7 -f - 

*«E«3-C{ifcy4/iiajagttli^7 5m 1 *8ffittDT¥fir{tLfcHiLoad SP 
ijv A (Pharmacia ; 1 6mmx 1 0 cm) \Zfrli^ 0. 0—1. 0MCDKC1 
T^^> h?$tilL^o *n*'n<DIj»£fgflM3£H«CLTTR-AP-S 
PA£i::J:*)*MrLfco Sli#£ 3 H-iR9iA*M<!:0&l&£E]2K*-f o 

xoy 5— fe'SttMgii&ttSgjO. 1-0. 3MOKC 1 l~<kOi&ih&tiZ) 
Eft (a2*«)77j->a >«4 3-4 7) 2 7. 5raL^SKLfc. 

fc 0 wt^T> ftlB*4mL©2 0J<^ , J-bo-yl/*#t;aiittDK«JWLfco c 
O^t©^ »<*Ha§fltli 1. 0mg/mlT*,fc o «l*-C\ C<Z>tS&£7-< 
/U^-ffiig (0. 45 a) Ltz 0 

Xffiffl 5 : Sephacryl S-400 i» 5 A \z «k -5 ?dt^77-(- 

**«4T?»fcHiLoad*5A«f«j«iS*tt«lrttD+5 OmMOKC 1 Tsp*Hfc 
LA: 1 5 mm x 7 5c mCDSephacryl S-400 (Pharmacia) ^JCtV® L 

fco *tt*'*l<DiI#£flife0iJ3 ilSlfilllLTTRAP-S PA&KJ: t))||fL/: 0 
&Ii#£ 3 H-&9&*fi£©il8ftsa3K*-f, 

xpy 5— l?gtt*£t;iB# (EI3*cD7 5? > 3 >#^2 2) 3m 1 
L/Co 0. 2 8 9mg/mirft-,/: 0 

Xffifl6 : SDS-P AGEfCkS^tfr 

3*M5fl5-C»fc?n> 5-#rStt*£t;li# (03 *C07 5 * •> a >S^2 2) 
8-1 0J«Dy^fi«%SiNtSDS-PAGEi:J:5iMfflfco 

*©ISS, #J4 0~;ftl 2 0 k D©n8iraa©/<> K#tfctiJ$*u #j 
1 1 0 kD, ft 5 8kDfc«£t>'#j4 5 kDO><> KtfSfK f*tb$*ifc 0 

-t?RNA©B£ 
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(1) -797. -f n> 3--t'RNA<D9 n-->r 

House Liver Total RNA (CLONTECH. CATALOG #64042-1) £7-X 
iU RT-PCRSi:<tt)7-5 x-rnj« 5— b*RNAifie?*iiSU p G E 
M-3Z f (-) t^^-OSmalt-f M:ty>n-i>yLfc (pGEM 
3Zf/mTR) , «i»fc5* &tf3' 7v 4 -e-tt, ^n-en, 
5' - GGG GTA TTT AAG GTC GAG GGC GGC -3' 
5' - TTG TGA GAA CCG ACT TCC GGG TGC -3' 
Tfc* (K?iJ8^3fc«fctf 4) o 

(2) 7">7foy5-l?RNAOM 

BamHlT^Wb/:pCEM3Z f/mTR (T 7 R N A * 'J ^ 5 — t? izj: 

!2 P7^a/:fo^ 5 --tr'RNA£. Riboprobe Combin 
ation Syste!n-SP6/T7 (Pronega) £ffil\ ffifflWESKfi^aS! Lfc (£fi££*l 
fcfpy7-t:RNAi:i^ 5' *«iS«tO f 3' %m^9 * -Ef>JKS*-T5 
ggGGG CGA AUU CGA GCU CGG UAC CCG£. AGG GGA UC£#*ft*'fttfiDS*l*) 

o 

(3) SDS-PAGETSMULfc'O K^cD-rny- 5— fcfRN A«D*S£* % Co 
llins (Cell. 81. 677-686) f=«-^TflWrLfc 0 

i§i^xn> 5— t"stt*^trii» (Ei3*©77 * ■> 3 >##2 2) znmm 

6t®miz8%SDS-P AGET'ftMLfco 9 >'<*K«D«*BS U£fT? gift 
T\ SDSy;Mfc5 OX.* *y 5»IBU ttl*"C 1 0 y 

4 UAio yvt/*5 0mM«DT r i s-T-tr^-h 

(pH8. OK 1 OmMcDMg C 1 , % 1 OXO/'J -tr a -;l/T'¥£HI: LfcflU 
"P^^UfctD^ 5— tfRNA£YeastRNA (Sigma) iitllftlifc, - 

*<0£g, »4 0-ttl 2 0 kDO|BUcaa©/<> K*<tWJ*ti. ft 
1 1 0 kD„ ft 5 8 kDfcJ;^4 5 k D<D'<> K*<&< 8Wj$*ifc 0 

aafc_kg>»lffl05T<l6te 
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e n s 

E?«I£^ : 1 
E?J©g$ : 1 8 

: -** 

E^JOSS : ffeO&K (^SEDNA) 

: 

ffeCQtWK? : TS Primer 
EM : 

AATCCGTCGA GCAGACTT 

EM#* : 2 

E?iJ©g$ : 2 4 

tg©& : -*« 
Mny- : 

E?i|<Z)§g : ftfcco^K (MDNA) 
E?iJ©#8fc 

ffeOtefS : CX Primer 
EM : 

CCCTTACCCT TACCCTTACC CTAA 

E3*J8^ : 3 
E?iJ©g$ : 2 4 

w.nv>w. : #K 

«0« : 
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CGCGTATTTA ACCTCGAGGG CGGC 

EMM : 4 
EMOfi* : 2 4 

E?'JOS : 
H©£ : 

EM®«H : ffeCfcS (nfijcDNA) 

mm : 

TTGTGAGAAC CGAGTTCCGG GTGC 
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IS * CO ia SB 

: 

*AM*r4 : 

tf^fi: SDS-PAGET'IiJSLT, i&4 0~#J 1 2 0 k D ; 
*«ttft : R N a s ettaJCJ:*)7gtt<b$*i« ; 
: -7^7 7.x 5--e'RNA£*l&-f S : 

2. ray 5— b*IStt*ffr >'^K©»?»*<S DS - P AG ETiflSE 
LT> #]1 1 0 kDx »5 8 kDfcJ:Cf/4tert*!i4 5 kD-C**wi*4*»t 

»*©igBBSliIEtt©*£* 

3. TK©3fc¥WttR*Wr3C<fc*1**£-rs. fn>7-fgltm 

ftfflfcJ;tfgM#g& : Jl«[*»0!teft(*©xDy 7DNA3' 0H*«O#fi 
£te£-f * : 

: SDS-PAGETWSOT* ftllOkD.&SSk Dttzii®4 5 

kD : 

*HWt : RNa s e»ai=J:07gtt4k*n* : 
*©*>'<*»„ 

5. ffflssflsai^*, k»t yvHta^o-e n 

sa-r^wiicckojf 4ctA<-e**w 11*111^ 4 ©fan* 
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WO 93/07838 PCT/JF97/02904 

X.lip SV2 n e o, p SV2 gp t, pMTVd h f r tt £*Jfl^T— WBfll 
!JM*BK v - ij - * *^**-k:J:*> L fcitfllliSjKt 5 fc & 

li, S V 4 0SJ#j7*o ^t4 i Oi9-/Df>3SeK4*t3X7'7 
-Y^E^JDNA, 7**©£-^n t">fi^fr*©#y77*:=.MfcSH4, SV4 0 
'JTx-MtSMi, 3fctfKpBR3 2 2 &#©$gi$j££fc <t 
Cf T > f-> 'J >IHfciBe* pKCR (Proc. Nat 1. Acad. 

Sci. USA, 7 8, 1528, (1981)) tt£#*tf*n*. 

38R'<* * -©S&MflS^fiD&Ali 'J >a*^-> n^fc ationic 
1 i p i d£DNAO* y 'i7i Uffll^ h 5 >*7 x ? > 3 IftWC 

ft?:i*Ti4. UTtt, MEM, RPM 1 1 6 4 Ofc<f£jfHv 5~ 

1 0%jilir&??&T$> L < ttaSS©-f >->»'J >, 7 s ** ^ ■*•/ >, h v 7 

£?ov h^^-ICiBfUSl-rtg-C**. 
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■fa*-9-t\sT\L *m&. 7 7-^£3fE0t>©. 0(1* M h 'J 7* h 7 r > 
-S-fitS* (trp),7? F-x^-^d > (1 ac) . A 7 7 -vP U PR, 
T 5 7 7-V<Z>mm&fc I P<?>7'a : £-9-?&Z? 2 5, P 2 6 7°d*-*-^ 
tf^tf £tl£ 0 CttiiliflRtfp a 07*0*-*- [Ag r i c. Biol. 

Chem. 52. 983-988. 1 9 8 8*] © J: -5 K&S«=3fc£, IStfSn 

D j|? 7 - AM-bE^J £ L T li , 7<8§M. 7r-*l£S3fc0t>©Tt>«J:^A<, D N A 
^•JSRK iOffJfiLfc 1 6 S 'J +*7-ARNA©3' JfeMttCffilff ttttEfll* 4 

i&m&i±mm l t *> o =» > -t >^ xe?ij^^ ^ *> © t t, * i > 0 £3i&*£ge?ij i* 

»teH-f©«-c**i t*<2* U\, Sfc»3l'<**-±©SDEW£K«I1M& 
^^-iCQgflflfln-^^tii^^S (IfM^l - 9 5 7 9 

T, 7*nt-^-OTgiJC0lJxlft^^v>>^6fijy±^^T ^ yBEyipfea-. 
K-T5DN A*fEttt, $ *K*©T*K:OTattxa> ^-tffifiKSfr?* 
J: »K t > ; / *ttfS$B«f d ^ 7 - H?S6f Xlit 
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*!/- h *5 'J^^f- K*3tf60^orT-4?-CM-r 

*@lR-r K i^T^i. -faf-T— if CiiWKftttH PL Collet 0^ 

±E©<6. Xa^^-iLtltffl'CJSfeOiLT, p U A I 2 (#B§¥1 
- 9 5 7 9 8^$S) -^rfilK© p K K 2 33-2 (Pharmacia#) ^£ 
glfiZttfTZZo tils B#£jfo©£fi£$Ol' S - h 5 >X7 x 

LT3§3l£-tf S&m^? * -i LTpGEXf'J-X 
(Pbarmac i aft) *?'JJS "f 4 - <t t X* Og£yiJ**iJffl Lfcfg 

K*<pIIBtt^^ LTp P r o EX- I (G i b c o BRL) 

K& L"Ctt, ffiAtf I n v i t r o g e nttffl'^aO^WM* * b T* 
hiv9X'<?9 (MAXBAC 1 ", B ACULOV I RUS 
EXPRESSION SYSTEM MANUAL VERSION 1. 4) 
87ia7iH:ii|l>, Z<D*y h £{gJB"f 5 w £ *<T « So COBf % $HRft*± 

gfigt LTIi* 2 8°C~4 2X:A<iSiTfe5o 5* h-X*<o > (lac) 
<D7a*-9 -%mMTZ%i£te. 6 0 0 n m © ttft K OR jfe 

0. 5i:ttofcil5?, *fiK*<lmMgKlc«*J:'5C I PTG 

7f-K*JBl*T, t7-;, 7-;K -7$* '<££'© tt?L£Utt&£ft 
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PCT/JP97/025O0 

»«^ft>.t7ra-7 (Pharraaciatt)^,n. NTA . 
7*fo-a (QIAGENtt) ^© r^EWJ K4MttKtt**-ay#:,K* 

©as • arm r 5 c t *<t * « . u± © * ? tt ^ &T?# 6 n te + D , _ _ 

l*fc*ft r7-«^.j „ K CTwo-hyb r i d) fcj Kff^ 
ttJBffSfcttfcffil^ il»7D^-ifISIi:ilfi(|i; 

.fc'4ttBttO*:*ICf^ M^tfC 1 on t e chtto TMa t ch Make r 
*fej=«ttitf, SMBQttl &V^Tns 7-*rZ&a&<tKmmtzm&R 
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WO 98/07838 PCT/JW7/02S04 

7"n y ifcli^'J =f? ? K • ^7^v-^l^PCRSi:J:»)*i] 

SDS-*';7? U;U7 5 K««*»fti::J:*a*Sa<tt2 4 0 kDaO^jgtt 
fi#'J*7* Kt> SDS-^'J7?'J^7; KWaijcilfeKiSfl'-fStftt 
2 3 0 kDa<Dgttfi#»J^^f" KOffSsWHSSnT^So *>-x ft2 4 0 kDa 
©^SttS^'J^r^ KtfMKJSRU #j2 3 0 kDa©jSttffl*»J K 

±iao/£tts# , j / <7°^ Kftweita*y^^f- K©#«ft*ais-r*c£ 

2 4 0 kDaC#H7f Kift2 3 0 kD a<D#'J "*7f- K iOffSJt** £ 
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WO 98/07838 PCT/JP97/02504 
CD-^T'J.-y hOfl-T-fifcTOlCfflJEU ft 2 4 0 kDaO-1!Hyf K£#J 

2 3 0 kDa©#y<rf K®#ttJfc*»£-f 5o ufc»ia<pffl 
ft 2 3 0 kDa®SSff®#ftJfc:W*mLTl*ftHN ttttWtt 

CD 7h7t^'fDy 7 -?- Fp8 oiaef KfflHttae^ott 

Internet JITx National Center for 
Biotechnology InformationOhome p a g e i: 
T?-t?XU TBLASTN^D/vAjlT. f t^t • f D^7-tr • f 
h p 8 0©7 ; > ftffi3F(|i:HBB«7 i Ji&WM**- K LUSDNASS 
J'J^^^L/Co -S-Oife^ Expression Sequence Tag 

(EST) DNAEHO-F-^'O^CfiftSn/:* 7 9 h P C 1 2 *fflIS&3}E© 
«tB*WttmRNAE*Jlcffitf«ttDNAE*iJ (E*Q£oE?iJ##3 ) p80 
© — SB® 7 ; y ®EJiJ{Cpl>ffi[gt4 (High S c 0 r e : 9 4 , 
Probability:!. 7x10 " 3 ) • / W&M (TIBS 1 :7? 

hcDNA) KW#*c: 4 fc (St 1 , 7 = y^Ji 1 

U X listen K>*ST) . 
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P 8 0 (N*JMI) AVX I R.N EL. 

, v hcDNA (N^fflO) XAS L Y ARQQL 

P 80 YIRTTTNYIVAFCVVH 

5, h c D N A NLRD I AN I VL AVAAL L 

P 80 KNTQP.F I EKYFNKAVL 

7 y h c D N A PACRPHVRRYYSA I VH 

p8 0 LPNDLLEVCEFAQVLY 

5/hcDNA LPSDWNQVAEFYQVWY 

p80 I cc*««> 

5'/ hcDNA L (C5|?jg«iJ) 



(1) T®£ftfcp 8 007 ; /®EyiJH«g»TPl^|^f4^r>-r7 y h£&© 

K1-5mRNA^HIgi:#S-rst>0^<lf?^B^"Cfe5o p8 0i:«H 

^JCgS^nfcDNAEJiJKfBffi&tjttDN AE?iJ*<CC!)7 i /8ffi?iJ£=>- K 
■rSaltltt^feO. ^&¥3*ifc*h£:©mRN Alt*-??* •> >r£%ux, 

£-f* 77^ ^^l/XTlISll*n/:7 v h 3 Y l*fflB&&3EZ 1 9*EBBs^<i. 

22 



WO 98/07338 PCT/JF97/02S04 

Chomczynski <&~fiik (Anal. Biochem. , 162, 156 
- 1 5 9, 1 9 8 7) KJ:,TRNAtB8U. -fttfc*., Z 1 9&BJS1 0 8 
/7ii?>-f 7f * 5/7* - Fjgjft [4M^7ii?>'f Vf * j/7*- h 
(fnftUfcg) . 25mMn>®tl>^i. (fP3fe*fi») , 0. 1M 
*/|>x^y-;K 0. 5 %V fro h y >}A (ft£*tiB) ] +T**v> 

t-fXU 0. lgfi02M»h l J')A (p H4. 0) *Mx.Tti»L^o £ 
©**S? a *- h £*£S<D*«»ft:7 x y -A- (ft&ttS) £c/0. 2ggco? 
ou*;l,A (ft^ffiH) Ay7;A7*3-A (ft&MBK) (4 9#K 

ttSfifc) *in*Tl OfMB*L<iIftU 10, OOOxg, 2 0#H©Si>fl' 
»i:J:0±J&0;!kJB*lHlJRl/fc. @JRUfc*JU;Sp$fi©-f vfw<j -;u (ft 
ftMg) *fi»U -2 0lTl^TOLt^ 15, OOOxg, 2 0#IW 

KJfflBU *«Ji©-f V^DyV-^iDX., -2 0«TlBflHJfc*Pl,fcSL 1 5, 
0 0 0 x g, 2 O^«O3B^«m:»}|6RNA*0iRLfc. 
RNAOf&KliJsJTOJ^fcfTofc. -Tttfcfc, 0. 2 m g(D{|RN A* 1 mM 
EDTA, 2 0mMF<JX|gB (pH7. 5) 7 0t, 5£>M0lft 

«4 J: o KflDX-T, 01 i go-dTt^D-X*7A (type7, lcmx 
lcm, Pharmacia J±) KRHU ImM EDTAfeiCfO. 5M 
N a C 1 *#tf2 OmM h V X&KB&tt (pH 7. 5) T?* -3 i»*ft»a, ft 
arBWa*»-Tttr'&5'a**fflLT4 ju g©p o 1 y (A) + RNAm o 

±EOJ:? C bxmtlfzp oly (A) + RNAl7-f?D g^HSSir LT 
c DN A££-fiEt/, :®cDNAi:l Opmo 1 e©7 • 's + lJ-v- • -f 

7-f7-i2 00iiv hOMML VifiCTWR (TSUPER SCRIP TJ, 
G I BCO BRL) *1DAT 1st s t r a nd££jj£U 1. 4a 

iyhORNaseH, 40J.-»yh O^SIDN A-f 'J y 7-t' I S^l 5a 
-v hO*»SDNA5-ry-X*Hn^.T2 n d s t r a n d££jj£Lfc 0 £ 
7 x / d D+;i<Ato{iJ£fri^ ±»0*JI*lslJRLfc, HI ft 
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PCT/JF97/02504 

.1 5, OOOxg, i O&mcajSt&a&tff,,^ 
tt»K J: £ c D N A ©SUlX-S-fr fc 0 

±EOJ:9i:tT^infccDNAi:oi^T, Ri 1 e y 
(VectoretteS, Nucleic Acid Res. , 1 8, 2 8 8 7 
- 2 8 9 0) *JBHT, Q) (DXm-emtltzcDNAmW (E^J#^3) 
f *K 5' «±*Kftftt«*»0cDNAffiyiJ*j!WL*:. If, 

6 0ng©cDNA*T4*'J^7--tfTMSLT^«^iprt-fkt, Sbi'l 0 
hoMiSUPvu I I *«*l**tt© ±37 

aS^x^y-^tt^CT^L/^, T£S2i:^tVec t o r e t t e 
unit (vctAivc tB*7WS*fcfcc) 3pmo I e£DNA'J 

£2 

vctA:5' — AAGGAGAGGACGCTG 
TCTGTCG AAGGTAAG 
GAACGGACGAGAGAA 
GGG AGAG- 3' 

vctB:5' — CTCTCCCTTCTCGAA 
TCGTAACCGTTCG T A 
CGAGAATCGCTGTCC 
TCTCCTT-3" 

Vectorette uni t * c D N A*»g£ L 

T> Tl£^3 Ic^-fV ectorette unit ©K^Ht: /x^- 7 g ^ x 
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PCT/JP97/Q2$04 

t5vc tG*.^^i/*fK.^7-4, B3!W3i:*tcDNAE 

JiJi:Mry^xt«RaPC5' * -j ? u*^ K - ^-i*^ 

fcPCR^m^ RaPC5' *yrfjr^U*f- K- 7? 4 
PCRfl*-T^*-f^9-*JB^-C9 3«CTl»ni, 6 5«CT1^ £tf 7 2 



vctG:5" -CGGTACCGAATCGTA 
ACCGTTCGTACGAGA 
ATCGCT- 3' 

R a P C 5 * : 5 ' -CATACCTGGT 

AGAACTCGGCTA — 3' 

-8tDNA'j/r-tft^r P T7BlueT^^- (Pharmacia 
tt) JBKfca*ftfcfc*a*:fcttJ5£7>^;,y >-?«ftLT, ^7 

X5 KDNA*W«Lfc 0 »A* ft* P C Rt«OD N AEJlj* A B I 3 7 3 A 
•>-*x>ir- (Applied B i os y s t emsfc) ^MSanger 

*S KRaPC5 3C#A*nfccDNACfl||jihfc, 

R a P C 5 3 OttSEJFtefftff L fe*M DNAHMS n*E fl* 

OBinscEttemMi.i 7OST0JgiE^ *Bo? 7 haft 

T(iEnSeE?!**4 eftM* 1^ 2 4 4 * Tfl>"*»E5«l=#JC U C , 5c 
**B»**ifc. En«0EWt3ett|ftl 6 3-1 7 2*?ott*E*l 

(5- -TCTCTCCTAG-3' 1 ) Ahp 1 icing acceptor 
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s i t e©3>-tr>-*x@E?ik 5' -PyPyPyPyPy PyNCAG-3' 

3WJ3icEtt0«tt**i 7 0<O CT) li*«CliEfll#*4©e3n]lcfcOT 
li CAi ittoTtO, Rjt3K>TAGi« l jy>AAGi:4-jT^fc. Lfrfc 

• U—r* 7l/-fi<D7; yttffiflltt* Xg(l) 

fiSnfcf h7t>tp80O7; yffiEfllKWHtt (High Score: 
94, Probabi 1 i ly : 1. 7x1 0" 3 ) ^15 y hfi3ECD7 = /®?BE 
yijCJt^T. SfcKKlvtBBtt^jjjVrfcfflT**) (High Score : 1 2 5. 
Probabi 1 i ty: 1. 6xl0" 1! ), @£?"Jg<3B£?iJ#^ 3 <D 3 1 2 §g 

ffl (a) a< m itt-jt*!), 7 $ v @&a<7X'<-7*> (aao * 

^VD^» (ATC) KKJll,Tl**C£*<WUlLfc. 
(3) 5» h • f a>7-tfSfififtcDNACtt!l 

tt, SV4 0'7'fWTlf?S^n/:7 7 K3Y1&5S5SV-3Y1-C6 6 
jHBtt*&* 11(1) ©^feil^S&tttffc-Cp o I y (A) + RNA£#. 
STRATAGENEtt©c D NA^jJ* y hfcffi^T cDNA^IIIl/:, 
c DNA©ISS!ili"7-*7;HC^oTff 1st s t r and^SK 

•T -7 - CD ffi#£&&iQ$£ 2 u MiO £ Tft o fz „ 

ftK, c D N A <F>%V&\Z. D N A 'J tf—V\Z± oTE c o R I 
Ltz'&, fcfcm%Si: Sephacry 1S- 5 9 0 * 7 l»\zWM U *5lc>© 
E c oR I 7 ? ~f 5 — b.'fy 4 X©/J>$ 1 ■» c D N A^Hfel 1 /:, ^il 1 ?®^® c DNA 

*x*y-;i,a:»?igjKu ^»*yR»*E c or iTiBftatu san^ia* 

IK 'J >@HtStt*: A ZAP7 7 -^DNAilE® cDNA£DNA'J #~t?T 
ISfcLfc. *6K, cDNA^l/fciZAP77-v>DNA477-^tt? 
^■f^-i^^L,/;, J£(JiOfp|||i STRATAGE NEttOG I GAPACK 
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PCT/JP97/02S04 

GOLD I I i*„ SftO7_ a 7;n:j$,Tff,/: 0 »*ftfc7 

r-ya-f- *tfftK«l*7C»»C 6 0 0 h f 1 A«sir«jfe**Tlg«*m^ 7 

ftl 0 0 3JO7r-^;o->$#ifei:^ BB c 6 0 0 hf 1 AtftciBft 
?LJS,fcfc©iDl/^jA$2^S!U 8fe»atf7**'JMl,fca, X 8(2) 

»tt->r^**fifflLfc©T, *ft*Ko^T7r-*tt*£|5|JKU HtRtt^ 1 
St^D-WtL^:^ S t r a t a g en e t©7_ a 7yH:S£,Tlf 
fccDNASC^t^X; K (RET K RET 2, RET3) £i n 
vivo excisio n & K THJJR Lfc 0 
7*5*; KRETK RET 2, R E T 3 C^l*TttH»*ttWifiH*ft« I, 
3kb P> 2. 4kbp, 6. 5 kb p®cDNA#J*A3 
ftTfcD, Hli:35tJ:7C#«<DcDNA4<lfir5ttli:*5;i^, 
fco Mi:fi£oT^IcDNA^8}U RETlffl^tRET2S^ 
R E T 3 ©-§&#© D N A @2?>J£B?^ L/c G -C-^l^DNA S^ttKBftttffitt 

acse.Tfi^btffci^, # 4 . 6kbpi=*>fc4 7 c#tt*-7'>. .j- 

^^W^llH^fc, CO + Kli, Ii(2) T»*ftfc7* h7t 
^^p8 0O7 5yaBBMi«Htt*^1-RaPC5 3fflT5/aEyij (High 
S c o r e : 1 2 5. Probabil ityrl. 6x10 ~ 18 ) *>£-£ftT 

*<k t>tD^t-fi :< t(, fh7k ^ p 8 got i ywrnntntt,^ 

^ « H tt * ?8 * * k tt * - ( H j g h S c o r e : 2 3 4 , 
Probability:!. 1 x 1 0 ~ 49 ) 0 

UfcmRNAOytfvfiWf©****, »*ftfccDNA0S**MIIR©inRNA 
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PCT/JP97/02504 

©®f#*i***:, fa**, RET303' *Kfii^ ElttOEmiCjt 
rDNAE3FIJO-J%tt»^4 0 8 3-5 2 1 6 i:*fc««^DN ABrH-* 32 P 

(REn 0 1^ 0 7 , 0 8 . 0 9 . 1 0 , 1 3 ) * i n vivo 
e x c i s i o nj&KTlOJR Lfc 0 

r57iKRETi0K RET A 0 9, RET A 1 3 C-3l<TftlHBSttBr 
«B*fWLfc4:*,#*5. Okbp, 4. 9kbp. 4. 9kbp<DcDNA 
##A*ftT*D % B 1 i:*t «t ^ i:** o c D N A *<Hlt « fitKi:* 5 ; i 
#*MWLfc. C4a*©?t; % RET A 1 3*«rfcjr TRET7J itftSU *£ 
CSE,t ^* c D N A . R E T 7 <D±&<D D N AE*J*fftt L iz 0 

^7X; KRETK RET2RtfRET3**»Sftfc D NAB 
n»««t*M**fc*fct: 4> »jt3 K>H»T7 8 9 0 bp ( :b/;U 

(4) 77 f • rn> 7-^IfiIcDNAOTO-iW(D]5!f (5' -RACE 
&) 

XS(3) T»*nfccDNAI=li, Sfc5* «(0 A T G «fc 0 fe $ £ K 5 • 
H7l/-fAO» J h3K>**ai*tti^a6 % dries' fjOmRNAOEWC- 
-Rapid Amplification of c DN A Ends 
(RACE) fc*JH^TtMtLfc. 
5" -RACE&li, Clontechti©5' - R A C E + 7 h*JBn s v 
- a 7W:«ifl, fci lg (3 ) Kfc^TS V - 3 Y 1 - C 6 ********* 
Poly (A) + RNA2 M i, E^£®E9JS4 1 OffiM* ] 4 9 3 - 
1 5 1 SCSBfl.JcfiMWttDNAE^O^yrf^^u^^K^-f^-NcEXS' 
1 Opmo 1 e£*ig£u JO»Lfcf*C**l,fc 0 
(G I BCO BRLttOS uperScript), gjf? ? u*^ K ££ 
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mitO*.?* 2lCTH*flBfilS*-tf-fc. EDTAfciDfcTfiflteffihStffctt, 
7^*7 y*agT$l!RNA£#8¥U -f VT'p/V-yl/zt^ffoTmmc DNA 
&JH1BI,/:. 3*1:, :<0cDNA(0#ii:, 5' -RAC Effl7># — 7°-5-l' 
v- (5* -P ( + ) ANCJ 4 pmo 1 e £R N A 'J £fi§t>T:Ii&2 
fco KXSttx 2 5 96PEG#£TK3 7tt3W^>/:„ 
fcfc, NcEX3' TffiCTr^-f A*ft H $$IC3' *CT>*-DNAE 

w*#iiJ3tifc¥(tc dna*«mh£ it, T^a-DNAfcHttWtt* y 

K^-f7-RACE-PRMt, E?lJa<Die?lJ#^ 1 ©&&#-5§- 1 0 3 9 
- 1 0 5 6 ®0#t:;fSflttttt D N AEW®* «J * 1/** -f 
RaPC5' i£fflOT, P C R K «fc 5 D N AJ#«*fT-» *o KfcIC I* 2 0 £«D 
lg©J&gic DNAi^« 1 Opmo 1 e©:/?* G I BCO 
BRLttOTa q# y y 5-**JBl*T»{*© V- a TA/ICftoTP C R*fTo 

fc. fcfcu ^M**W«DNAitfll*atf*fc»i: % giftt-7ia7;u-*7 h ■ 

7s9- hft-CMttLfca^ 9 4 < CT3 0S\ 5 5 TJT 1 £\ 7 2tT2^0t-f 
3 5S^ifit^. 

PCRl*tpT7B 1 u eT"5**-Ktt*i&** jfffD N A ©# A $ ftfc 
fc©lcwcDNAE79*JIM*LfclilL :n^©H© 1 0 9 u-yftfttm 

i;DNA@syij£Wi/Tc^o :nb©^D-^o^, ftgftfc? o- >t&* 

RACE 3&tfRACE 5 (±0 1 K**«k ? tttttKglE LTfc >K E?iJ£©E 
*U**1©8«»1 9 9-2 0 lOATGOS* fl»±^2 0 0 bp*Tc DNA 
jMME*Rtf#ft**lfc* C tfrt>frr>tZo %fflm<0W,n%^ 1 ©£»•«§• 1 9 9 
~2 0 1©ATG«£D 5' IH±8ltl=tt, E*l*9 1 © 7 U -f A £ £ -5 fcjfc a K> 

i*JiHj 3 ftfc* W$^^DNAos$*<ujfisj— t^s: <ba>£ N g 

KOmRNA05' J&IZttflS"*-* c DN A*f»fciJ16tt;6<«i!£#*. fcftfc. 

(i) t h •■?o/7--tfaaffae*©»fl'iffr0*ff 
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PCT/JP97/025W 

Sdegenerativ e P C R 7 5 -f 7 - * C £ K «fc $ , z<07^ 

-fv-^ffii>tPCRfti;j;,-cf h?tjt1- + ?y KJ^©#IMlg@W©T 

**\ t^^7^7-tLT, BE?»J£<DlE?IJ#^lCD7 = y®#-5|3 7 9~ 
3 8 4<r*fj£-r *HPET5 (EJU«OffiyiJ##5) , 7 >f-fe 
i LT, IEjya©§eWf 1 ©7 i / gg#^§- 5 3 2-5 3 7 * H P E T 3 

(Eyijs®ie?ij#^6) mmm 1 ©ia<3) -5 ,fsv-3yi 

-C 6 6$ifl&E&#c DNA&tfH*n£#arctt»**ifct h#ftlf«B£&P A 

- 1 *(8ISEfc3jE c D N A t LT P C R *#&KTf? P A - 1 Jnj»& 

*cDNAOTMiiLT0SV-3Yl-C6 6lB»a*c D N A* * t> 

t >x/7^ 7-i LT\ E?iJS0ie?iJ#t 1 07 $ y®?##3 7 6~ 
3 8 5K*tt£T*HPET 5 - 2 (i2JiJS©iejiJ#-^7 ) i fcliiS?iJS©@e?i|# 
#1®7; / Kf^3 8 0-3 8 8 K*fj£-r*HPET5-3 (i2JijS©@£?iJ# 
*8) x T>7 t -b>x^7'f-7-iLTEyiJSog2?ijS^i©7; ;i§f 5 3 2 
-5 4 0K*f£-rsHPET3-2 (K?!l£©iS?iJ#4§-9 ) *fcl*E?)j£©K;ij 
##107; ygj?##5 3 4 -5 4 2j:*»6"r*HPET3-3 
##10) SV-3Yl-C6 6JfflSSi3!£cDNASa'PA-lMJ!S* 

aLfcy^UV^^U-^-TSILfci^, HPET5-2Jfcii 
HPET5-3iHPET3-2iOl^^TSV-3Yl - C 6 6«ld3 
*cDNA*»SiUPCR*fi,t^i:, fSStl^5 0 0 bp(DDNA 

sh*. p A- iaaaa*c DNA*»sd:Lfc*^jrii % 
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WO 98/07838 PCT/JP97/02904 
HPET5-2iHPET3-2 t<D&&&t>-£<Dzf7 4 -r-ZM^Z Z <tK«fc 
oTH*l=tt50 Obp©DNABffr*<HHBSnfc.- C ©D N Ag/rfr£ 
pT7Blue^7X; Ki:f 7'?a-i>rLTDNAE?i|^Lfct:5 < 

7 v h cDNAEWK»U**Ttfj7 7 96<D«Btt*#5* T I J Wt 
l/"C;i/-Ct>7 6K0ffiBtt*3*TDNAE5»i (12, E8I3 © E*!*^ 2 ) 

kt, ?#£ftfcDNAEjii©m*g£gic t h • 

c D N ASfK*^ P C Ri#it§T# 5 'J ^ ? ? K • y7^7-4gftLfc, 

•t^^^T-fv-iU, EBI«0EJ!I*-92CD«K*#9 2-11 4 ir^f" 
5hTPC5 (EJiJS©E?iJ## 1 1) , 7>f-fe>x77-f-?-£LT, WM 
£©E?>J#-t 2 ©&$#^4 3 3-4 5 5 Kttifc-fS h T P C 3 {WM&V>W3\ 
##1 2) &fa©fc HimRNAfiScDNA^SIIiLtMCT 

PCRfcfT^*:, 

tUSMSBBRNA. fc h Bullae Jlll^fiJSER a j i^Hafi^RNA, 
t hM¥±£$*&A 4 3 l&B&S&p o 1 y (A) + RNA, t h¥UBA£ 
BT4 7 4JfflflS, SKBR33fflfl&, BSMZHBia, S^MCF7tt^po 1 y 
(A) RNA^Ch omczynski ©2;j£ (Anal. Biochem. , 
1 6 2, 1 5 6 - 1 5 9, 1 9 8 7)St;Pharmaciatt©+7h^ffll' 
TSStff Pharmac i a fct© Firststrand synthesis 
k i t*ffll>TcDNA^lfc 0 

:tlicDNA(5i) 1 tt2 0#© l&tffiMt LT, hTPC5ihTPC3 
*/7-f7-tLt|^fcPCRitTot, DN A# 'J > 5--tf<i: LTte, 
Amp] i taq Gold (Perkin-Elmertt) £J?H^ 9 
1 O^-P^OfiaS©^, gS'CTSOS?, 6 5^CT3 0&\ RV7 2"CT3 0 ^i? 
©{S1&-7M ?>1>£3 5[i3g>)ilL/: 0 MSnfcJ&S 9 0 b p ©DN A 

»>*-*< fc h&«B8&E&*c NA^SISit LfcifciriiMBStiT**:*, ftS (-) 

31 



WO 98/07838 

PCT/JW7/028M 

Cffllt*, hTPC5ihTPC3*r7-f7-iLT«^ntft h -tv/ 
7-tfS6«cDNA«>|-*i»«T*4Ct*<*lj W Lfc©r, C 1 on t ech 

tt§2t mw*cDNA7'fr7';-<oi$, i o^-fi©7 

1 OOJflK07r-i>*«faiLT^fci4i:fa S ft fcjt43fl)DNA 

© c D N A Jf ASBfi© 5 ' flij£tf3' fljKtfjfc-rs 2-3©* y * u** K . 
^7-f7- (#^ 5 ' Utl 0&tf3* A g t 1 0) (C 1 on t e c htt 
«) ihTPC5thTPC3t^T-4LT«K hTPC5©5' fflfl± 
«t^/={ihTPC 3 ©3' «TaE©*»0«^©cDNABfJt©ai|*K*fc e 

cDNA^r 7 'j-®^i oo^i«7 7 -i;*i»aiu 4iio®y^ 

^-©|fi£tf (hTPC52*5' Ag t 1 0£/cli3' A g t 1 ORU h T P C 3 
#5' ^tl0$tli3' igtlO)©PCR*±Cfl)7jfti:«oTff-»fc. 
*:«U7i-;Ufi®i6 5X:©ft*>5lc5 51C»CLTffofc 0 *©*JR % hTPC5 
©5" ffl±SMijl. 5 k b pK#I&*S»#©DNAR>i-#Jiff*hfc. 
(2) h h • raj 5— tf»e»3ifi^ftc DNA©3m 

*1\ Ra j iJBISiayPA-l|fflB**i*ti|t,l«ffl^i, RNAzo 1& 
ffi (T e 1 - Te s t tt) ^TChomc zynski ©#& (Anal. 
Biochem. x 162, 156-159, 1987) i:j:98RNA*«ff 
U »6ftfcttRNA*0 1 i go-dTtAD-^* 5 A ( t y p e 7, 
lcmxlcm, Pharmaciatt) KftLT*ft*ftft 1 00^g©poly 
(A) + RNAtUfc. 

cDNAOMlCli, poly ( A) + R N A 5 u g *«FgCjgi*fc 0 
cDNA synthesis m o d u 1 e ( Am e r s h a m J±) KjStt* 
tifciSSCTBMf. 'J*'^"J7-*H, *BSDNA*ijy 7 -**JBi s 
C^ii:tS,TZ$icDNA^U; 0 c DN A synthesis 

module (Am e r s h a mft) KgSft3tifcT4 DNA*y /-7--tf£ 
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2« fi «x^y-A,«*L*:. 15. 0 0 Ox 

g. 1 0^01^^, ^^-^flaiK:J:»JcDNAOIB«*ff,fc Q 
B«UcDNAt|»LT2 0 /( |(O«W| ttcWLfcft4 10*1© 
cDNA (ft2;zg) W:EcoRl7m- (SjKft ) 

20ffl«T4DNA^-*SJ6« T6 6mMh«Jx 
ttBHK»ftCpH7. 6). 6. 6mM M g Cl 2 OMtMX) . lOmM^f 
**l/-f h-* (DTT % ftftftX) . 66n,M7f;y>5' - = U>» 
(ATP, S IGMAft) J +T3 5 0JMk©T4DNAy^-Hf £ 
Hi:i6X;T2W04 i ^ hU 2 0 0 pmo 1 eOEcoRI 7 ^ 
9-*c DNA<D5fc$Kjj££L*: 9 

ft»ttSc^8ephacryl S - 2 0 0 * 5 A ( 1 c m x 4 c m) 
' = * MU lmM EDTAiO. 5mM NaC 1*^1 OmMMixW 
«ffft (PH7. 5) */|i, T , c o R I T^-tfljoLfcc DNA 

»«^cDNA*x^y-Afl:aTin«U 2ta*ft«tt, 2a 
lOIKfHfclccmiU. *6*l5**IW*EcoRI (»*) T ^ 

i:««'J>«ftUiZAP7r-i>DNA (S t r a t a g e n efc) 1 

EcoR ^^^^-^tfc±lS©cDNA (4 0 0 ng) 
16^T4DNA^- W n + T 1 8 *«-f 

*t£*it*: 0 Zt, lz> eDNA4»*| / fciZAP7,-j>DNAt 
Gigapackl IGol d (St ra tagenett) ^T7 7 -;a 

^ft^r-^t^^c 6 0 0 h f 1 AflEKBftficfJt 
«*T7r-^ WL fc,- a8ifta!| , lOOngCcDNA* 

^T^lceOOhflAW^^ ^-Honzy^ 
±?iMLfc. ^°>«i:7 T - 5 ;tt WLft , T:i/riJj! ,^ 2ttflEaju 
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a6ffcDNABrfr* 32 p«»LTrD-ritTffli^, :®ro-7i>^'j 

S t r a t a g e n ettO^i a TAKfct^ c D N A»fl-**t?r 

5*5 K* in vivo e x c i s i o n&KTEJJR Lfc, 

(3) ^iSb h ♦ fD^7-tfgafcDNA3' AITflCee^OKff (3' - 

R A C E jsfe) 

±EIS<2) VftitltzmRU AZ&imiZLTs Marathon™ c DNA 
Amplification kit (Clontechtt) RACE 
fti:«t5cDNAOlfi^,fc e «T©SJ6l-*l»T, *«DNA^7^7- 
Jix Ma r a t h o n TH cDNA Amplification k i t KBff 

AB I 3 9 4 DNA*jfctt*JBi%-c*tfLfc. fiffifc, 
Marathon™ cDNA Amplification k i t KgstfSft 
«fc d N T P t > Tfr -> fc o 
**\ cDNAiO^jr,/:, »«lfcpo 1 y (A) + RNAl^gi 
cDNAi£lS#^7'fv- x 5 • 

TTCTAGAATTCAGCGGCCGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 
(G/A/C) (G/A/C/T) -3' (5 2?^tfK) *iS{S?Sfi: 
T3 7TCT»SU »l*cDNAH*Lfc. ft 2ttttft££&Ot**<D?flt 
ft * *T I * , cDNAOHii:7/r^-^9^7-, [5' - 

CTAATACGACTCACTATAGGGCTCGAGCGGCCGCCCGGGCAGGT 
-3* (44?*U*f- K) t5' -P0 4 -ACCTGCCC-NH 2 -3* (8 
J 9 Vt* K) ] fcfcfil&ofifcjft l 0 0 1 *#fcLT5 0 ji 1 

t U £ttt<DWI££rc 1 # 1 *fflW:„ 

t h • f DN A®E?(la— gBiffifllfttt?? 

-CCATCCTAATACGACTCACTATAGGGC-3* (2 7 7:7 
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l/**K) J , tC^ICT a q DN A-f ^ 7 -t'^OTfjo t 0 fil6«©^fi 
£50^ltU 9 41CTl^H©-f a 9 4 t T 3 0 g-'fig. 

ON 7 2 1CT 7 ^HOO* 3 Ifc, &m 

«©1 O»01«5*PAGEICTfiW?Lfc. ±Eg««(0^ ft5 A 1 

ACTCACTATAGGGCTCGAGCGGC-3' (2 3 7;i/tf K) 
*fflt>1\ Ta qDNA*>J>5— tfT?Oit«fia;*firofc. EJE&®£g£5 0 
^1£U 9 4°C-Cl 7 >fSO'f >**^--/ 3 9 4*Cr3 0#»L 60 

0»H. S^6 8t;T5^||©^> + i ^-^ 3> ^3ot^^ l/m ^ 

&s KEftOl 0#©lg* 5 96 PAGEKT|WrLfc c 

^Bff^^^ltifgb^cDNAESfjt^IlIlRLT^L, T4DNAU*'- 
**«l'T^7X; K^-pCRI I (InvitrogenJi)©? D - 
->7»mztf ALT, #£ftfcfc*au** JM1 0 9%$:%% 

X-Ga I - I PTG-LB-Amp*^#*±KifiaLfcBttB-c. 
^X-Ga 1 «:«t*)^feL-C^^c>3oo^iSgi^,coi>T, tffcicfti^ 
5X> KDNA*««U »«ff*fr,fc. ««Lfc75z» KDNA* 

^^TcDNAO«[*IEW*jfcSLfc 0 3' #a*fl*<DttSE*|* 

(4) 35££t h • fD^-t'iafcDNAS' ff±2ftE8l©flm (5* - 
RACES) 

5' -RACEfcOfifcl*, 3' -RACBftKJUUTff,*:, ^DNA^ 
Ma r a t h o n™ c DN A Amplification kit 
tm$tlK754 v-£WttAB 1 3 9 4 DNA^Mi^a^l^ s 
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Uli, Marathon 18 cDNA Amplification k i t i:^ 
ffStlfc«0**!J:CfdNTP*ffl^fc. mmtLXit, 3' -RACESoS 
JttmWUZ. PffflC7^^^-^7'f -e-*ftiOLfcc DNA^ll^:, 1 SIS 
©iftiSlSli. t h • -ro^f ^--tfgfiMc DNAOSyiJO— SUiffl^ffl-jr^-r 
^-fc^C^' MmzttljaLtzT???-??** v-tftffi&ti, 3' -RACE 
ife©fifl?OliKfeffl^fc^7-fT- % 5' - 
CCATCCTAATACGACTCACTATAGGGC-3' (277*1/ 
*f-K) *Jfll*fc. ElS8li£fi*5 0^ 1 tltx Ta q D N A 'J > 7-* 

9 4t:T3 OfcNSx 6 0X^3 0#RB. 6 8X^5^0©^ a 
>£3 *;l>*TlS 2^CT7^-HO^ > * a > a -> XS. 

JS*»7Lfc 0 ElS&x fi£«© 1 Oa© 1S* 5 96 P AGECTJBUf Lfc. * 
fc* ±I2S££:^© -5 ft 5 A£ 1 £5 0te*9?U -e©5 m 1 *»fit LTffl^T< 

t h • td> 5— tfSSBc DNAOEW©— StlBfflWClliIi©*(»«B5l5c 

ACTCACTATAGGGCTCGAGCGGC — 3' (2 3 7 7 Utff- K) 
*fflt%-Cffofc. filtli^ 9 4ttl#f|C^> + a'<--> 3 >Oi, 9 4XT 
3 OtML 6 0*Ce3 OWBU &tf6 8*CT5#IHI«D-f >* a ^-y 3 >430 
t^Ajfo, «»l=7 2°C"C7^KO'r >*a^- > g >*ffoT££*fc7 
Lfc, Eifc&, ElS$© 10^©1I*59<PAGE ?8¥#r Lfc 0 * ^Brtf* 6 
ittflLyt c DN A*EJRl/T»»U ^7X; K^?-pCRI I ©?o-.=. 
v/tttitKltALfct** fc&ftfcfcfcai^-^^T^aHn Ml 0 9«£ 
»ff<E»Lfc. X-Gal-I PTG-LB-Amp*551*ttl±l=aj|iLfc»tt 
I?, ^X-Ga 1 KJ:»>fcfcLT^tt^3o0JBBe»fltnoi%T, 
ftl^ 75* = KDNA*H8!ILfc. HglL/c77* • KDNA*ffil*Ttftfr* 
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(l) fc h • fny ^--i/lSISe^Mc DNAOSfc® 

*j8l*TcDNA5-f 75'J-*flSfiEl,fc. CcD^r? IffiS® h T P C 5 

(E5Ua©E?ll#* 1 1) «hliu2£©hTPC 3 (E8l*aB3l#?l 2) £7*7 

PA - llBft^^p o 1 y (A) + RNA4fffc, BPS, «U& 1 O 8 
t h'JfA (pH4. 0) ^iDXTS^JLfco HitpM©* 
iD^T^t < fil&U fi'&£ftKJ:0±»©*Jl*lHiiKLfc. mKL^Kiir^ 

jfiO-fV^D/^-AijlftL, - 2 0«-ClBflB?frfl!Lfc«, »ifr#|***T 
RNA*®«Lfc. 

8RNA£lmM EDTA, BOmMF'Jxii (pH7. 5) C^iU 
7 0°Cx 5#flBOfl!i&9tt, *±T*&L*:. c©8JfceNaC 1 
#0. 5M<ctt£J:5rcftlA.'C, O 1 i g o - d THJUa-xfi 5 A (type 7, 
lcmxlcm, Pharmacia J±) fcftMU ImM EDTAfcit/O. 
5M Na C I Z$&2 OraM h 'J (pH7. 5 ) T*7 5 A£jfe?f> 

*«tt**-e»^B**liltTpo 1 y (A) + RNA£?#* 0 

:®P o 1 y (A) + RNAiMjStratagen ettOcDNA^*-/ h 
*El*Tc DNAfclMULrt:. 1st s t r a n d^li77^7-i Lt? 

Jg£-2 ^Mj0x.T^Tofc o T4DNA#'jy 5 --t?*ffll*T c DNA*«©¥ ft 
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ItZft'* fcttK: Ec oR 1 T?-T?-£tt1iaLtz 0 Sl^lfti 

SephacrylS-50 0 * 5 -MCjggg L, *££© E c o R I 7 9? 9 - 
it-fXO/hSl'c DNAfcl&^fco c DNA*x^y-;Uct©-c(slJRL, A ZAP 
7r-> > DNAICjfAL^o 

cDNAi^lfcil ZAP7r-v?DNA^ Stratagenett© 
GIGAPACK GOLD1 I 1*7 7 r - i>tMP-\/< „ v> 

-il©&flnc«fc'K ftl 0 0 0^7r-^o-^#/; 0 
ftl 0 0!ff©7T-y*o->£#j$l::ftt>;*;B§®C 6 0 0 h f 1 A#{:®& 
7*U- h±©NZY*^**±Tl»«Ufc. t^o>gi:7 7 -^H 
?L»itl/yj *£2fcftSiU Sfc^StfT/U* 'JffilL/: 0 hTPC 5 i 
hTPC 3*7*74 "7- & LTfflC^PCRg$* 32 P$gSLT7°o-7*<»: LTffl 
CCD 7° D- 7*£'n4 7* 'J y^Xt £7 r -V - 9 a- S^Zf V 

^*0JRU B*tt2fST*a-WfcLfctfc. if ASftfc c D N A$ifr£$ts7 
77; F (p-HBOU pHB04) iin vivo excision feK 
THIRL fc 0 

7*77; KpHBOK pHB04 Ko^T»JRB*inBtffcBI*fl?Sl Lfc t C 
4* 1 . lkbp, 7. 4 k b p© c DN AA<}f ASftTfc 9. E4 
<k74\ «K+5ttfl|MflilC6*C t ofc. f^-.T^IcDNA 
■*ft«U pHBOK pHB0 4<DDNAI3BJ*)MILfc. CODNAEM* 
' «»HHBWIWllliac!ie-»Ta*'fr*9**:tC5, #8. lkbpt:i5/:5^ 
U c©«t»KC3feiM!l©* h * 7* • a K>*a-t?**/i3t--7*> • U -x 
4 >^ • 7 W AtfjaajStifc. Z©^-7"> - 'J-t^>^« 7Hii^f 

(2) h h • f o> DNAO»»-±«E5||<0ttff (5' -RACE 

&) 
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XS(l) T»&hfcDNAEyimEfl*<DE*>IMl 3lc**-DNAEM©9*> 

T5'-Rapid Amplification of cDNA Ends 
(RACE) afcfcJB^Tfctttfc, 
5' -RACEfcli, Clontechtt0 5' -RACE*^$fli^ 7 

-a.Tvnc«t^ofc 0 ia<i) ^fcc^PA-ijfflsa^^^n^po 1 y < A ) 

+ RNA2^gi > EJija©EJIJ#^l 3©fcg|#^l 165-11 8 7#§©$& 
aKttMWftDNAEfll©* 'J ? L/** K7°5 -f t — T L P CM 3 
1 0 pmo 1 JD*Lfc*ic*»(,fc 0 fteftftttcgs?** 

(GIBCO BRLttOSuperScr ipt II),M^ WF% 
WM**MT4 2tTlWa5**fc. EDTA*aiTRl6*»jtS 
7A*-JftlTi»aRNA**»U ^'^D/^-AitB^tJn 
mcDNA^-mSSL^o 3*<c, :<DcDNAOfgi:, 5' -RACEffl7> 
[5' -P ( + ) A N C] 4 pmo I * R N A 'J £ffi^T 

*l=. TLPCM3T*W5^*ft, s^g- «K7>*-DNAE 
H*MnS*lfc**cDNA*IIB±LT, T >*-DN Al=«»»tt* y rf* 

1 0 2 4-1 0 4 6©«^C«ttWttDNAEWO*'J^3»^U^-f- K7^7- 
TLPNB±**l*T, PCRK.fc*DNAJt«*f?,fc. Stew; li 2 0 #© 1 
iWcDNAi#MOpmol0^^-i^ GIBCO BRL 

ttOTaq^j^-^ft^ft<07 ia7 ;H:i«,TPCR^fc, fc 
*U *4M*ttDNAJtff**tf«*:»i: % fiUttT-^^.*, h . 

httTBMH,*:*, 94tT30^ 6 0ttl», 7 2tT2ft<Df -f M,* 
3 SEfciJigtfc. 

PCRi»tpT7B! u eT*C**-K:**a* % ifHD N A©*fASn* 
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DNAEJ>J*WLT(,*fc 0 cn^^D-xon, ft*«Jtf*o->T* 5 
RACE-L4<ig|4 ll*1-ffiiUC#SE-r 5 o fc. E7!l30E?lj** 1 3 

1 4 4- 1 4 6 i:H-7U-AO»tt 3 K>*#£UCo Sites K>®5' «± 
%1 5 7 bpST, *«StlfcDNAO*S^is«^— c fcl5ci3& , 6% g B<0 
mR N A® 5 ' c DN A*#fcpTtgtt*<iBSl^#A ^nfc. 

8££ (E?IJ£cDi2*JS#l©7i /®#^2 1 7-3 4 5*§l=ffi3rS«# 
*y«7>K) &oa*|&ff (GST-p80hom) **ft«*ffi^Tfta 

ftgu*:fie?fttt*iftJK& iT9t«tttfc, 7 7hfD 

£©®*IESgf (6 Hi s-p8 0hom) ±LT*»**, ««Lfcae» 
tfa6**St*-t 5 * 'J ? d - *;l#tft: (E?lJS©iS?IJ#4§- 1«D7;/ gg## 2 1 7 

**\ «a?9*S K^-pGEX2T (Pharmac i a ft) «M M 
mSma iTfflKLfc*, H i nd I I I Wtttt***** U * > 
K- 'J >*-*»AU ISS^^-pGEXHl 2£fl-g|L*: 0 
««mEeoRlT«|||L^ T 4 * -j * * - * (3Hltt ) tflt,T*« 

*™tu miznm&m indii i«it e 5 , h . fDy 

5 - *aa» c D N Am*Vzr? 75KRaPC53 *«R*XB a mH I "C 
ttKU T4*«j>5-* (ft*!*) **t^T***JH»4kLfc. *o». $Uffi 
**Hi nd I I IirTStolCWBlfUT, ^'J7^jA 7 ; K • y**«ftfl&K 
#L "' ; K •^ D >7-4fSSgcDNAO$C7>DNA®f>t- (E*Jg©£?>jS 
tl««itt8 4 8 - 1 0 3 4i:«it*ft3 9 0 bp»HindI I I- 
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B a idH I DN A glfK*) ^mSlt/Co J«± K <* 0 ft * tl fc 

H i n d I I I -¥»5fc*©p G EXH 1 2<^-t?7h - td>7-^1 
6Rc DNA4*DNAKK-t*DNA5^ y-S/ a > • *v h (Sffiift) 

^t*»s** »&nfci^*****-*jii^T*M«5JMi 09 cx$tt) 

«W»H*fM&U jEUVfi*3M.77a i K*fiWUTt*« pGEXp 8 0 h om 
/J Ml 0 9*a«Lfc. 

pGEXp80hom/JMl 09*, 7 > k° •> D > *^tf 5 0m 1 ©LB§ 
ttCft&U 3 7^T-BftSt9^SL/Cc SBCftfc^CiSifeTl Oft#RU 
7*CTlNFlBI*«bfc*x IPTG*«»fi*0. 3 mMCtt * «fc ? K 
IDA, SDS-PAGEWfift44 kDa®GST-p8 0hom*?S5J3 
GST-p80h om*!65l2Ftffcfc*3k;t.*IIBlJF r a n g o n i © 
(Anal. Biochem. . 210. 179. 1993) KftoT. M 

h h >X- 1 0 0 £flD;trt:&. ^I** f-* > • -fr7 r d-* • t'-X 
(Pharmacia tt) £Jbnx.T©i§ Lfco 4^CT4 0 jMBS L tt A< «i ft& L 
fc«, K-X*196 h5-f h>X - 1 0 0£*tr»J (PBS) 
bT*7Ai:^lfc. h"-Xi:»^tfcG S T- p 8 0hom£2 5 mMMii 
&.?fr9**'s]iV0. 1 % h 7-f h >X- 1 0 0£#frH e p e s 

AStiCli* 1 OOml^Oiim^bO. 7mgOGST- 
p 8 0 h om;&<ft<s>ftrt:<, GST-p80h om* h o > t'^&g-f <5 £ <tK«t 
0. Sfi^g&Hlis SDS-PAGEICfctf SJfc*tf0a^fi*<*(j2 9 kDa© 
GST£#jl6kDa©7vh- x o / 7 --tfgSgBrtt (E7!l&®e?iJ#-? 1 
IZmtvv h •fD < /7-t?I6fi:fcK7; y@S-t2 17-3 4 5i:*B£ 
-r-SSP^-) 0 2oi:«JBr*ti*i. HttPVDFiK|S^fcftiLtN3fJ*<07 

*«BLfc. #ltft2. 6 k g©B#£3fc&ii*-7-*2:jj (R l&tfR 2) 
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feCttoTlElKo* 1 0 0/zg©GST-p80hom£7D-<>f. . T y a 

5X5 KRiPCS3«MliIHind I I I &VB a mH I ?TOU 5, 
hf D >7-tTOcDNA©«f,3 9 0 bp©HindI I I-BamHI 
ODNA0f)t (EllSomf 1 Tflflft 6 4 8- 1 0 3 4 CffSti) « 
JtflU COgff^^p B 1 u e S c r i p t (gftft) ©Hindi I I — 
BamHiattlWD-^n/c, MHffXX h o I XtfN o t 

h>65» h •fo;7-*SejJcDNAO«BH64 8 
- 1 0 3 4 CEnS®E***l) «=*3T* DN AKfr**tfX h o I - 
No t I DNAK>f**au MlifSi 1 IRUNo t I T^JBr L fcj»r 
7Xi KK^-pProEX-1 (Gi bco BRLtt) iDNA5^y- 

K*«*bTl*SpP r oEXp 8 Oh om/JMl 0 9 ZMIRLtZo 

pProEXp80hom/JMl 0 9 *7 > •> y 5 0 m J ©L B 

**C*«U 3 7T:r-l!feiS±-ji*§§L* 0 2BCO*8tt*Ht;*tt T i 0 
***U *6C8 71CTlW*«LfcJt % IPTG*«»a«imMCtt* 
**CjD*T, SDS-PAGEWfi»18kD«»8Hi s-p8 0hom 
SISSIS-**:, 6 H i s - p 8 0 h om*jm$*fcfi**;l*Mr** 
Q i a g e ntt©^ h ^W:t,-C6M/7-^!g|^^ M| . 
»*U Ni-NTA-7#D-* (Qi agentt) VMttLtz. t'-XZft 
frLtz'te^ mgltzQ Hi s-p8 0hom*6MJHR*£frpH4. 3©Tr i s 
/y>»«WT«lBL«:. «gi*n/c6Hi s-pSOhommi^t 
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tZo ±1®KT7<< f)\>\ 0 (Biora dtt) *!$®£-ti\ 6Hi s-p80hom 
^^d^'J Lt7 7^-f^ • t"-XittmiK 0 &m®Kli, 1 0 0m 1 
^OpProEXp80hom/JMl 09 <D%%W#.fr h 0 . 7 m g CD 
©6Hi s-p80h om^in^ *©9 5%£U:^7 7^ - ;H OK? a* 
'J >9£titZo 

TAntibodyJ (Ed Ha r I owt»l, Cold Spring 
Harbor Laboratory Press) ICfQ8fc©2rfeKt£oTx GST 
-p 8 OT&BESftfcR 1 © 7&RgS©iaft£jflirt2m li^il75<ig (R 1 
-41 d) x R 2<D7i!ia@0i§fc&JfiLft2m 1 a>£ 8 6 ju g (R 2 - 4 1 d) 

7 5 ^^#^2 1 7 - 3 4 5K«*8*«tt£) KOREAS* * C Sx*? 
y -7vy hfcT*BLfc„ 

JIJMH *6*»«3T»&ftfc7 » h*fclik h&#®ro^ 7 --tf3£ff 
c DNA*<, &SSK7 »Hfcttkh fo>7-t?Ifig*3- KLTl** C 
^■BIT©*? KfEWLfc. SW«4Tj|4h/;a**A7» h • 

t- u / 5 - * S&tffc&itfS $ ft & * £ 9 L fz a 

If, Rl ®ftfffTJllLftA*&7 r o7-4' >A*770-X (Pha rmac i a 
tt) *Jl^T*&I'gG*ttfflL (P I - 1) „ CO I g G £ R 1 OflftftjflLft* 
*»fcftfc«r»I gG, Rl - 4 1 d (jU9ENte«7flttjflinid«) StfR 1 - 
1 1 6 d (ftttMtilttl 6&&lfil*&&&) ©3 05*0 I 8 GW/of-f >A 

t7 7 n-xi:3- h l/c. t hi»4*pa- imimv? -j Ufa* 

3ft A H 6 6 FttIS*** Counter (EMB 0 J . , 1 1 , 

1 9 2 1 , 1 9 9 2) irftot S 1 0 0 tttB$££H!S Lfc 0 cottitB«i:«$ft 
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©1KCHAPS/1 xHjrpottW (Counter^ ±»**) ^ 
^ t150 " 5 go I «G*3-HfcJ:B^DfOA-b7 7 D- 
4 1C-C1. 5*M&L*:. 0. 5«CHAPS/ 

*oy*-*©Stt tt% Tate D atsu«(Oncogene, 13, 
2 2 6 5 - 2 2 7 4. 1 9 9 6) ^+^>«« d UTP4tt^ 
+ S>*i>tt**JflH*:EL I SA*T«Hetfc. fc«u fD>5-* K Ji 5 

bpTG3(Biot inylated 5 ' - 

GTAAAACGACGGCCAGTTTGGGCTTGGGGTTGGGGTTG 
-3* ) &m\ &*0. 8mMOtym->7;wF (TTP, dATP, 
dGTP) *S»iLT. 3 0TTl*«fifcs* fco |f SM)CEDTA 
z. 5 c <t ir J: $ jfc $ ^ o 

EDC (Si gmattS) ££H>T* h 1/7 hrfy> (G I BCO 

»l^.3 7tT2Wr D ,*>yu i ±EO**x,U=, TBSTfRU 
TD>7-l?flrfifieSft2 5 A 1*1DAT, 3 7TCT3 0^«fiLT^U-h 

******»W>:*, 20mMTr is-HCl (pH8. 3) , 75mM 
KCU 0. 0 0 596W-1, 1. 5mM MgCl 2% 4*M<3bpTG3. 
loMOtU.j.n^FT-^T-, pTAG^v (5 ' - 

CAGGAAACAGCTATGACCCCTAACCCTAACCCTAACCCT 
-3' ),^50^MOdATP, dCTP, dGTP, 2 5*M©TTP, 
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(G I BCO BRLitS) *#fcPCR£0ft£i O ;L % * * 7 • 

PCR*-v*-!M*7-*/BOTPCRJ*tt*fir>fc( 9 3"CT3 0», 6 9 °C 
"C3 OfK 7 21CT1^#T3 4 . 

5 OmM^gff- h 'J (pH9. 6) 4ffi^5mg/m l CHgjL 

£1 0 1 /❖xyl/Oft-frT^tta, 3 71CT1 W»»LTX Y\,7YT\l 
*>*3-H,fc. XM/yhTfy^^HT^ ro» + >^JW*l 5 0 
/i l/frx^oate-cattU 3 7tT2^|gl7'D y + yyU; 4 :o^ xW: , 
TBST2 0«K#«?LfcPCRM*l 0 0^1 /-> x ^roftlA, 3 7°CT 
3 OMiSLt^U- Mitt^S-tffc. 5 0a I /->x;u 

©0. 05«Tween 2 0/TB ST5@^U;^ TBST5 0 0 0M 

fAUj£rt) ^iflx.T3 7°CT3 0#«&L£ o 7U- h£l 5 0 /u 1 /-> x ^ 
00. 05«Tween20/TBST5TOL^ 0. lM^i^-yb 
75>fi««(pH9. 5) Tl 00fS*«LfcCSPD (D i sod i urn 3 
-(4-methoxyspiroCl. 2-d i ox e t ane-3, 2 - (5 
-chloro) tricyclo (3. 3. 1. 13. 7) decan) - 4 
-yl) phenyl phosphate) (T r o p i x J±§?) £flnx., M 
iS"C 3 O^fHKfc^SSifcS-tfT^ I S / —9— (Kj[,y-)\,y. y + 

*i gG (Pl-i) £=j- h Lfcb--Xi:iifD> 5— tf»tt*<»^Ba!)in 
5— tfjgtSEi<B»6ftfc. 
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5 0 OTJ*!®? 7 hJrf&S&AH 6 6 F8BJ&£^ ->JI6!£lla?t 
(dFCS) 1 0 %&cftf / >l"< v a^ftMEM (DMEM) T 

Tffll^O. 596CHAP S/ 1 x H y p o«SS?« ( fCttUi L-fc. ")t*R10 
*«ilir]fiL»** l gG, * fc 3 y h • 5~tfaaK«frKJ:* 

»iaSS:iwffi^-rsaai^lwiD^T4^CT2^K«gLfco Laemml i 

CD S D S gLttMSflS^iO*. T ifli!l ■ £tt U 696SDS-PAGE "Cg^ L fc. 
y;l>*ft«-CB£Lfcfc. ENHANCE (NENttS!) ML. flE»«l=7^ 
-f — left U^o tofel, I gGMLfc*>^Hw 

CD*> &3 0 0 k Da©fl*tt'<> K^ffli^ti/:, 

HJfc0J7 : t h • fn>7-l?MmRNAOt hffiloi&&tfjEtt&lft£ feu- 
's 8% 

C 1 o n t e c h fcfcCDM ultiple Tissue Northern 
Blot &U\ Human Cancer Cell Line Multiple 

Tissue Northern B 1 o t^ffll^t, hh - f D^7-t'I 
SgmRNAOt F«ttBRtfiEttffl»lcfclt5»£*fcttUfc. fa-ft L 
TttSBMH2©XB(l) T®£*i*:t h • xo^ 7~t?g6®i2fs^cDNAB9r}t 

(E5«Ja©EW»2) * 32 p$a$uxffli^ /N-rry^-f 5 0 967 

t*i7i K#ftTC4 2*CT— ft«ff«»fc. *ro» hM£0. 196SDS* 
StflxfttfO. 1 x S S P EJMSttTiSlsfrL-fcfcK, t-l«7-^/77<- 

mmu <tm> ipjk. *j§> 4>hu mu # 
&tf»tt*©t hjE#&iafc&©p o 1 y (A) + RUA\z\*ymtt 1 o. 

7 k b©><> K#fcUJ*tifco *fc» h HBfi*»IS<*43!E©P o 1 y (A) + 
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3 x 1 0 9 ®<D7 v hfFffi&$MttAH6 6 FHfifrSCou nteriO 
(EMBO. J , 1 1 , 1 9 2 1 , 1 9 9 5) KflgoTS 1 0 0«jffi££fS 
SL/Co TMG*8a?£ (1 OmM Tris-SI pH8. 0, ImM 

7 * '> Q ^>n ImM ^f-t^H' h-;^ l 0 -tr o-;P) "CIS 

ftLfc^/N-'J >-tr77D-xCL-6B*77A (: r ^y7tt) Sft 

*^**tfTMG»ffi»refi&?DLfc'W Kn*>/7/<*-f ha^A 5 -y 

Ftt) Cil> 5mM KP&ffim (0. 25mM 'J > & ~ ?KJf — * y 7 a , 
4. 7 5mM y >»— **Zl*y ->A, 5 OmM *ffc*y*A % ImM 

ImM yftxi/'f 1 0%/'JtD-^) TjSfc^ 

0. 5M KP®S5£C2 5mM 'J >® ~tK5I — ft 'J 9 A „ 4 7 5mM 
7 A, 5 OmM aMfc*y*A, ImM aMfcvr*->*A % 
ImM U-f 1 0K/yHrn-AO £JBt*fcJRPt*tfJ**Tofc, 

fD>7-^gttmSI^», 5 OmM ig-fb^ y 9A££frTMG 

T\ f-ajl 7-Hfffitt*Wr*B^**A, 0. 5M gft* 'J -7 A £ 1 mM 

(Zn 2+ ) *b- h77^-f^7A (WfiS'W h?,^ + >*\ 

•tfStt^Wl-S^liJliid* 1 5 - 4 0 96^y-feo-;Ugft43Eai|>£jBI y ^ 
7> ttSW2 8 o-*- % 2 5 0 0 0l2]S> 2t, 2 4^) icftLfc. *©& 
fay 5--tfStt±«H©**afiK£ LT4 4 S ©ttfcfcttfcjjrr *>©# 
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n*ti, *«D#-?Sl**J 1 5 0 0 k D a <fcff-*3nfc. 

* 6 1: * /Hd- /l/fi«4gEa'l>£»?& U fc«-H»* 696SDS-PAGE 
T#IBU HififlH T«tf#Lfc&&;L 77^70/7 -IflfilCjItt 
UtttT* x >7n 7 K£fr-3 ttt -r o ,/ -5 — t* Sfcfe'&^'t'SSff 
IC<4 3o<Dtt#fil6tt<D^<> K (SDS-PAGE±©#?fi(i&2 4 0 kDa, 
2 3 OkDa> 55kDa) ft fc 0 :©?55 5 kDaOA> Ktiffifl 

91101 C£ 9 2 4 0 kDa*fc(i23 OkDaflISfC^18ItT*5:i* 
5S®L^o C©&£J;>K 5? h • xd^ 5— tffc&ffKli 2 4 0 kDa©§6 

V car r P 2 4 oj i^tsci uT*ffi*nfc 

2 3 0 kDaOiaf (J«T Tp230J tttSCiUi) ^iLti 

p24otp23 ov&jRmmzm^ztz*), mis©'*** • fx^^M 

ffofc, 1 0 cm^^f^f-f -y ->a{r^t>^v 7 hBT&ft5fc*fflJ!&!*AH 6 6F 
i!S^2 5 0 ^Ci/m!C [ 35 S] (ffio a o£Tran 35S- 

label, I CNtt) £ 1 0 96^l6iSjfil?t (JRH ><-f x >xtfc) £ 

-fir lm 1 ODMEMS* (/ft->, y^f-f>^, ^Or-^fJi 

»£*ifc$USeMltt* 6 96 SDS-PAGE % &1%T-*- ^7-^/77^- 

p 2 4 OTfco/i. L*U MM (In 3 n 619110) i:p 2 4 0I**^U 
p 2 3 0*<i#iDLfc. :©:i^<i % 77 h - 7-a / liUtt 
p 2 4 0 £^tit8fi2©geg£ LTRSSft, *©&{gfi5£gtfTp 2 3 0*£ 
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vv hJEftj&mRU? v hflf&S5fc*fflS&tfcAH6 6 FSBfi&+<Dp 2 4 0/p23 0 

©^Sit-ts x 5--tf«tti©«H*n-<fc. *r, 5-y Ufa. ffflu « 

M^AH66F«ia^^Count e r^©^S (EMBO. J, 11, 
1 9 2 1 , 1 9 9 5) KfcoTS 1 0 0Jfi£tt*U£U cn«*tt0j8£ffl* 
ics/<y >t7 7 D-xCL-6B*7ACT«*»8Lfc, #fny7-^ 
K -at* TBI**. 5 v h • xn> 7--tfgfi^#^UT;^^ffii%/-o iX 
9>7a- y h^CkSp 2 3 0/p 2 4 Offttit, fc«ktf x r o > 5 —tfjgtt© 

3fflfl§, ffiiJL ff«T**K »a-C«HttJ*fttt*v,fc 0 p 2 3 OO^fiJt 

tt'2H»JHK AH 6 6 F$fflJ& % fflAL fFBT*ofc. ffBTIi«£/,£p 2 3 0 fi 

C©&jg<jp 2 3 0 O??aEJttrD> 
P 2 3 0#SttS, p 2 4 0(i^SiiSO7.y h • r a > 5 —HS&KimOL-t 

h©p 2 4 OTisssftfc^it LT^fiK^nx *o»fi*smp 240 

IfiJCtrii, H^JSttJH*J:^Sttja-CttSDS-*'J7^ 'J*7 5 K 
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seyijs-t: 1 

WMO&Z ■ = 8215, 7; y® = 2 6 2 9 

gE^JoeS : c DN A 

: 

AGCTCCGCCC CTCCCCTTGC CCAGCCTCGC CCCTTCGCCT CTCTAGGGTG TTGGTTTCCT 60 
TTCAGTTCTC TTTCTTCAAC CTATCCACTG GCTGACCTAG GCCGGTTTCT GCTCCTTGTT 120 
GCGGAGAACC AACGCGCCCC TCACTGTGCA CAGCTTTTCC AGTCCCGAGC GCAGGCACAT 180 
AGAGATTGTG CTGCCGCT ATG GAG AAA CTC TGT GGT TAT GTG CCT GTC 228 
Met Glu Lys Leu Cys Gly Tyr Val Pro Val 
1 5 10 

CAC CCA GAC ATC CTC TCC TTG AAG AAT CGG TGC CTG ACC ATG CTC TCT 276 
His Pro Asp lie Leu Ser Leu Lys Asn Arg Cys Leu Thr Het Leu Ser 

15 20 25 

GAC ATC CAA CCC CTG GAG AAA ATA CAT GGA CAG AGA TCT GTC AAC CCA 324 
Asp He Gin Pro Leu Glu Lys lie His Gly Gin Arg Ser Yal Asn Pro 

30 35 40 

GAC ATC CTG TCC TTG GAG AAC CGG TGC CTG ACC TTG CTC CCT GAT CTC 372 
Asp He Leu Ser Leu Glu Asn Arg Cys Leu Thr Leu Leu Pro Asp Leu 

45 50 55 

CAG CCC ATG GAG AAA ATA CAT GGA CAG AGA TCT GTC CAC CCA GAC ATC 420 
Gin Pro Het Glu Lys He His Gly Gin Arg Ser Val His Pro Asp He 

60 65 70 

CTC TCC TCA GAG AAC CGG TGT CTG ACC TTG CTC CCT GAC CTC CAG TCC 468 
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Leu Ser Ser Glu Asn Arg Cys Leu Thr Leu Leu Pro Asp Leu Gin Ser 
75 80 85 90 

CTG GAG A AG CTA TGT GGA CAT ATG TCT ACT CAC CCA GAC GTC CTC TCT 516 
Leu Glu Lys Leu Cys Gly His Met Ser Ser His Pro Asp Val Leu Ser 

95 100 105 

TTG GAG AAC CGA TGT CTT GCT ACC CTC CCG ACT GTA AAG AGA ACT GTT 564 
Leu Glu Asn Arg Cys Leu Ala Thr Leu Pro Thr Val Lys Arg Thr Val 

110 115 120 

TCG ACT GGC CCC TTG CTC CAG TGT CTT CAC AGA TCT CAT ACG GCA CAA 612 
Ser Ser Gly Pro Leu Leu Gin Cys Leu His Arg Ser His Thr Ala Gin 

125 130 135 

GCT GAT CTG CGT GAC CCG AAC TTT CGC AAC TGC CTG TTC CCT GAG CCT 660 
Ala Asp Leu Arg Asp Pro Asn Phe Arg Asn Cys Leu Phe Pro Glu Pro 

140 145 150 

CCT ACC ATA GAG GCT CCA TGT TTC TTG AAG GAA CTA GAC CTT CCA ACT 708 
Pro Thr He Glu Ala Pro Cys Phe Leu Lys Glu Leu Asp Leu Pro Thr 
155 160 165 170 

GGA CCC AGG GCC CTG AAA TCC ATG TCT GCT ACA GCT CGA GTT CAG GAA 756 
Gly Pro Arg Ala Leu Lys Ser Met Ser Ala Thr Ala Arg Val Gin Glu 

175 180 185 

GTA GCT TTG GGT CAG CGG TGC GTC TCA GAA GGA AAG GAA TTG CAG GAA 804 
Val Ala Leu Gly Gin Arg Cys Val Ser Glu Gly Lys Glu Leu Gin Glu 

190 195 200 

GAA AAA GAA AGC GCA GAA GTC CCG ATG CCT TTG TAC ACT CTA AGC TTG 852 
Glu Lys Glu Ser Ala Glu Val Pro Het Pro Leu Tyr Ser Leu Ser Leu 

201 210 215 

GGG GGA GAA GAA GAA GAA GTG GTG GGG GCA CCG GTC CTA AAA CTC ACA 900 
Gly Gly Glu Glu Glu Glu Val Val Gly Ala Pro Val Leu Lys Leu Thr 
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220 225 230 

TCT GGA GAC TCT GAC TCT CAC CCT GAA ACC ACT GAC CAG ATC CTG CAG 948 
Ser Gly Asp Ser Asp Ser His Pro Glu Thr Thr Asp Gin He Leu Gin 
235 240 245 250 

GAG AAG AAG ATG GCT CTC TTG ACC TTG CTG TGC TCA GCT ATG GCC TCA 996 
Glu Lys Lys Met Ala Leu Leu Thr Leu Leu Cys Ser Ala Het Ala Ser 

255 260 265 

ACT GTG AAT GTG AAA GAT GCC TCC GAT CCT ACC CGG GCA TCT ATC CAT 1044 
Ser Val Asn Val Lys Asp Ala Ser Asp Pro Thr Arg Ala Ser He His 

270 275 280 

GAA GTC TGC AGT GCG CTG GCC CCC TTG GAA CCT GAG TTC ATC CTT AAG 1092 
Glu Val Cys Ser Ala Leu Ala Pro Leu Glu Pro Glu Phe He Leu Lys 

285 290 295 

GCA TCT TTG TAT GCT AGG CAG CAG CTT AAC CTC CGG GAC ATA GCC AAT 1140 
Ala Ser Leu Tyr Ala Arg Gin Gin Leu Asn Leu Arg Asp He Ala Asn 

300 305 310 

ATA GTG TTG GCC GTG GCT GCC CTC TTG CCA GCC TGC CGC CCC CAT GTA 1188 
lie Val Leu Ala Val Ala Ala Leu Leu Pro Ala Cys Arg Pro His Val 
315 320 325 330 

CGA CGG TAT TAC TCT GCC ATT GTT CAC CTG CCT TCA GAC TGG ATC CAG 1236 
Arg Arg Tyr Tyr Ser Ala He Yal His Leu Pro Ser Asp Trp lie Gin 

335 340 345 

GTA GCC GAG TTC TAC CAG AGC CTG GCA GAA GGG GAT GAG AAG AAG TTG 1284 
Yal Ala Glu Phe Tyr Gin Ser Leu Ala Glu Gly Asp Glu Lys Lys Leu 

350 355 360 

GTG CCC CTG CCT GCC TGC CTC CGT GCT GCC ATG ACT GAC AAA TTT GCC 1332 
Val Pro Leu Pro Ala Cys Leu Arg Ala Ala Set Thr Asp Lys Phe Ala 
365 370 375 
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CAC TTT GAT CAC TAC CAC CTA GCG AAG TAC AAC CCA CGG AAA CAC CGA 1380 
Gin Phe Asp Glu Tyr Gin Leu Ala Lys Tyr Asn Pro Arg Lys His Arg 

380 385 390 

TCC AAG ACA CGT TCC CGC CAG CCA CCC CGC CCT CAA AGG ACA AAA CCT 1428 
Ser Lys Thr Arg Ser Arg Gin Pro Pro Arg Pro Gin Arg Thr Lys Pro 
395 400 405 410 

CCA TTT TCA GAG ACT GGG AAA TGT TTT CCA AAG AGC GTT TGG CCC CTT 1476 
Pro Phe Ser Glu Ser Gly Lys Cys Phe Pro Lys Ser Val Trp Pro Leu 

415 420 425 

AAA AAC GAA CAG ATT TCG TTC GAA GCA GCT TAT AAT GCA GTG TCA GAG 1524 
Lys Asn Glu Gin lie Ser Phe Glu Ala Ala Tyr Asn Ala Val Ser Glu 

430 435 440 

AAG AAA AGG CTA CCA AGG TTC ACT CTG AAG AAG TTG GTA GAG CAA CTG 1572 
Lys Lys Arg Leu Pro Arg Phe Thr Leu Lys Lys Leu Val Glu Gin Leu 

445 450 455 

CAT ATC CAT GAG CCT GCG CAG CAT GTC CAG GCC CTG CTG GGC TAC AGG 1620 
His lie His Glu Pro Ala Gin His Val Gin Ala Leu Leu Gly Tyr Arg 

460 465 470 

TAC CCA TCC ACC CTA GAG CTC TTT TCT CGG AGT CAT CTC CCT GGG CCA 1668 
Tyr Pro Ser Thr Leu Glu Leu Phe Ser Arg Ser His Leu Pro Gly Pro 
475 480 485 490 

TGG GAC TCT AGC AGG GCT GGG CAA CGC ATG AAG CTC CAA AGG CCA GAG 1716 
Trp Asp Ser Ser Arg Ala Gly Gin Arg Bet Lys Leu Gin Arg Pro Glu 

495 500 505 

ACC TGG GAG CGG GAG CTG AGC TTA CGT GGA AAC AGA GCT TCT GTG TGG 1764 
Thr Trp Glu Arg Glu Leu Ser Leu Arg Gly Asn Arg Ala Ser Val Trp 

510 515 520 

GAG GAA CTC ATA GAC AAT GGG AAA CTC CCC TTC ATG GCC ATG CTC CGG 1812 
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Glu Clu Leu He Asp Asn Cly Lys Leu Pro Phe Met Ala Met Leu Arg 

525 530 535 

AAC CTG TGT AAC CTG CTG CGG ACT GGG ATC ACT GCC CAC CAC CAT GAA I860 
Asn Leu Cys Asn Leu Leu Arg Thr Gly lie Ser Ala His His His Glu 

540 545 550 

CTC GTT CTC CAG AGA CTC CAG CAT GAG AAA TCT GTG ATT CAC ACT CGG 1908 
Leu Val Leu Gin Arg Leu Gin His Glu Lys Ser Val He His Ser Arg 
555 560 565 570 

CAG TTT CCA TTC AGA TTC CTT AAT GCT CAC GAC TCT CTC GAT AGA CTC 1956 
Gin Phe Pro Phe Arg Phe Leu Asn Ala His Asp Ser Leu Asp Arg Leu 

575 580 585 

GAG GCT CAG CTC AGA ACT AAA GCA TCG CCC TTC CCT TCC AAT ACA ACA 2004 
Glu Ala Gin Leu Arg Ser Lys Ala Ser Pro Phe Pro Ser Asn Thr Thr 

590 595 600 

TTG ATG AAG CGG ATA ATG ATT AGA AAC TCA AAA AAA ATC AAG AGA CCT 2052 
Leu Met Lys Arg He Het He Arg Asn Ser Lys Lys lie Lys Arg Pro 

505 610 615 

GCC AAC CCG AGG TAC CTG TGC ACC CTG ACG CAG CGG CAG CTT CGG GCG 2100 
Ala Asn Pro Arg Tyr Leu Cys Thr Leu Thr Gin Arg Gin Leu Arg Ala 

620 625 630 

GCA ATG GCT ATC CCG GTG ATG TAT GAG CAT CTC AAG CGG GAG AAA CTG 2148 
Ala Het Ala lie Pro Val Met Tyr Glu His Leu Lys Arg Glu Lys Leu 
635 640 645 650 

AGG CTG CAC AAG GCC AGA CAG TGG ACC TGT GAC CTT GAG TTG CTG GAG 2196 
Arg Leu His Lys Ala Arg Gin Trp Thr Cys Asp Leu Glu Leu Leu Glu 

655 660 665 

CGG TAT CGC CAG GCC CTG GAA ACG GCC GTG AAC ATC TCT GTA AAG CAC 2244 
Arg Tyr Arg Gin Ala Leu Glu Thr Ala Val Asn He Ser Val Lys His 
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670 675 680 

AAC CTA CCC CCC CTG CCA GGC CCA ACC CTC TTG GTC TAT CTC ACA GAT 2292 
Asn Leu Pro Pro Leu Pro Gly Arg Thr Leu Leu Val Tyr Leu Thr Asp 

685 690 695 

GCA AAT GCC AAC AGA CTT TGT CCC AAG ACT CAC TTG CAA GGG CCT CCC 2340 
Ala Asn Ala Asn Arg Leu Cys Pro Lys Ser His Leu Gin Gly Pro Pro 

700 705 710 

CTG AAC TAT GTG CTG CTG TTG ATC GGG ATG ATG ATG GCT CGG GCG GAG 2388 
Leu Asn Tyr Val Leu Leu Leu He Gly JJet Met Het Ala Arg Ala Glu 
715 720 725 730 

CAG ACG ACA GTT TGG CTG TGT GGG ACA GGA ACT GTG AAG ACA CCA GTA 2436 
Gin Thr Thr Val Trp Leu Cys Gly Thr Gly Thr Val Lys Thr Pro Val 

735 740 745 

CTT ACA GCC GAC GAA GGT ATC CTG AAG ACT GCC ATC AAA CTT CAG GCT 2484 
Leu Thr Ala Asp Glu Gly He Leu Lys Thr Ala He Lys Leu Gin Ala 

750 755 760 

CAA GTC CAG GAG TTA GAA GAA AAT GAT CAG TGG CCC CTG GAA ACT TTT 2532 
Gin Val Gin Glu Leu Glu Glu Asn Asp Glu Trp Pro Leu Glu Thr Phe 

765 770 775 

GAG AAG TAC CTG CTA TCT CTG GCT GTG CGA AGG ACC CCT ATT GAC AGG 2580 
Glu Lys Tyr Leu Leu Ser Leu Ala Val Arg Arg Thr Pro He Asp Arg 

780 785 790 

GTC ATC CTG TTC GGC CAA AGG ATG GAT ACG GAG CTG CTG AAT GTA GCC 2628 
Val He Leu Phe Gly Gin Arg Het Asp Thr Glu Leu Leu Asn Val Ala 
795 800 805 810 

AAA CAG ATT ATC TGG CAG CAT GTG AAT TCC AAG TGC CTC TTC GTC ACT 2676 
Lys Gin He He Trp Gin His Val Asn Ser Lys Cys Leu Phe Val Ser 
815 820 825 
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GTC CTC CTA CGG AAA ATG CAG TAC ATG TCA CCA AAT TTG AAT CCC AAT 2724 
Val Leu Leu Arg Lys Met Gin Tyr Met Ser Pro Asn Leu Asn Pro Asn 

830 835 840 

GAT GTG ACG CTC TCG GGC TGC ACT GAC GGG ATC CTG AAG TTC ATT GCG 2772 
Asp Val Thr Leu Ser Gly Cys Thr Asp Gly He Leu Lys Phe He Ala 

845 850 855 

GAG CAT GGA GCC TCT CGT CTT CTG GAA CAT GTG GGC CAA CTA GAT AAG 2820 
Glu His Gly Ala Ser Arg Leu Leu Glu His Val Gly Gin Leu Asp Lys 

860 865 870 

ATA TTC AAG ATC CCT CCA CCC CCA GGA AAG ACA AAG GTC TCA CCT CTC 2868 
He Phe Lys He Pro Pro Pro Pro Gly Lys Thr Lys Val Ser Pro Leu 
875 880 885 890 

CGG CCG CTG GAG GAG AAC AAC CCT GGT CCC TTC GTT CCT ATT TCC CAG 29 IS 
Arg Pro Leu Glu Glu Asn Asn Pro Gly Pro Phe Val Pro He Ser Gin 

895 900 905 

CAT GGA TGG CGC AAC ATC CGG CTT TTC ATT TCG TCC ACT TTC CGA GAC 2964 
His Gly Trp Arg Asn He Arg Leu Phe lie Ser Ser Thr Phe Arg Asp 

910 915 920 

ATG CAT GGG GAA CGA GAC TTG CTG ATG CCA TCT CTT CTG CCA GCG CTG 3012 
Met His Gly Glu Arg Asp Leu Leu Met Arg Ser Val Leu Pro Ala Leu 

925 930 935 

CAG GCC CGA GCG TTC CCC CAC CGC ATC AGC CTT CAC GCC ATT GAC CTG 3060 
Gin Ala Arg Ala Phe Pro His Arg He Ser Leu His Ala He Asp Leu 

940 945 950 

CGC TGG GGA ATC ACG GAG GAA GAG ACC CGC AGG AAC AGA CAA CTG GAA 3108 
Arg Trp Gly He Thr Glu Glu Glu Thr Arg Arg Asn Arg Gin Leu Glu 
955 960 965 970 

GTG TGC CTT GGG GAG GTG GAG AAC TCT CAG CTG TTC GTG GGG ATC CTG 3156 
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Val Cys Leu Gly Glu Val Glu Asn Ser Gin Leu Phe Val Gly Il e Leu 

975 980 985 

GGC TCC CGC TAT GGC TAT ACT CCC CCC AGC TAT GAT CTG CCT GAC CAC 3204 
Gly Ser Arg Tyr Gly Tyr Thr Pro Pro Ser Tyr Asp Leu Pro Asp His 

"0 995 iooo 

CCC CAC TTT CAC TGG ACC CAG CGA TAC CCT TCG GGG CGC TCT GTA ACA 3252 
Pro His Phe His Trp Thr Gin Arg Tyr Pro Ser Gly Arg Ser Val Thr 

1005 1010 1015 

GAG ATG GAG GTG ATG CAG TTC CTG AAC CCT GGC CAA CGC TCG GAA CCC 3300 
Glu Het Glu' Val Met Gin Phe Leu Asn Arg Gly Gin Arg Ser Glu Pro 

1020 1025 1030 

TCT GAC CAA GCT CTC ATC TAC TTC CGA GAT CCT GGT TTC CTT AGC TCT 3348 
Ser Asp Gin Ala Leu Ile Tyr Phe Arg Asp Pro Gly Phe Leu Ser Ser 
1035 1040 1045 1050 

GTG CCA GAT GTC TGG AAA CCT GAC TTT ATT TCC GAG TCA GAA GAG GCT 3396 
Val Pro Asp Val Trp Lys Pro Asp Phe Ile Ser Glu Ser Glu Glu Ala 

1055 1060 1065 

GCA CAT CGG GTC TCA GAA CTG AAG AGA TTC CTA CAG GAA CAG AAA GAG 3444 
Ala His Arg Val Ser Glu Leu Lys Arg Phe Leu Gin Glu Gin Lys Glu 

1070 1075 1080 

GTT ACC TGC CGC AGG TAC TCC TGT CAA TGG GGA GGC GTA GCA GCC GGC 3492 
Val Thr Cys Arg Arg Tyr Ser Cys Glu Trp Gly Gly Val Ala Ala Gly 

1085 1090 1095 

CGG CCC TAT ACT GGG GGC CTG GAG GAG TTT GGA CAG TTG GTT CTC CAA 3540 
Arg Pro Tyr Thr Gly Gly Leu Glu Glu Phe Gly Gin Leu Val Leu Gin 

1100 1105 mo 

GAT GTG TGG AGC GTG ATC CAG AAG CGT TAC CTG CAG CCT GGG GCC CAG 3588 
Asp Val Trp Ser Val Ile Gin Lys Arg Tyr Leu Gin Pro Gly Ala Gin 
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L1I5 1120 1125 1130 

TTG GAG CAG CCA GGA TCC ATC TCA GAA GAG GAT TTG ATC CAG GCC AGC 3636 

Leu Glu Gin Pro Gly Ser lie Ser Glu Glu Asp Leu He Gin Ala Ser 

1135 1140 1145 

TTT CAG CAG CTG AAG AGC CCA CCG ACT CCC GCA CGG CCA CGC CTT CTT 3684 
Phe Gin Gin Leu Lys Ser Pro Pro Ser Pro Ala Arg Pro Arg Leu Leu 

1150 1155 1160 

CAG GAT ACC GTG CAA CAG CTG ATG CTG CCC CAC GGG AGG CTG AGC CTA 3732 
Gin Asp Thr Val Gin Gin Leu Het Leu Pro His Gly Arg Leu Ser Leu 

1165 1170 1175 

GTG ATT GGG CAG GCA GGA CAG GGA AAG ACT GCC TTC CTG GCA TCC CTT 3780 
Val He Gly Gin Ala Gly Gin Gly Lys Thr Ala Phe Leu Ala Ser Leu 

1180 1185 1190 

GTG TCG GCC CTG AAG GTT CCC GAC CAG CCC AAT GTG GCC CCG TTC GTT 3828 
Val Ser Ala Leu Lys Val Pro Asp Gin Pro Asn Val Ala Pro Phe Val 
1195 1200 1205 1210 

TTC TTC CAC TTT TCA GCA GCC CGC CCT GAC CAG TGT CTT GCT TTC AAC 3876 
Phe Phe His Phe Ser Ala Ala Arg Pro Asp Gin Cys Leu Ala Phe Asn 

1215 1220 1225. 

CTC CTC AGA CGC CTC TGT ACC CAT CTG CAT CAA AAA CTG GGA GAG CCG 3924 
Leu Leu Arg Arg Leu Cys Thr His Leu His Gin Lys Leu Gly Glu Pro 

1230 1235 1240 

AGC GCT CTC CCC AGC ACT TAC AGA GGC CTG GTG TGG GAA CTG CAG CAG 3972 
Ser Ala Leu Pro Ser Thr Tyr Arg Gly Leu Val Trp Glu Leu Gin Gin 

1245 1250 1255 

AAG CTG CTC CTC AAA TCT GCC CAG TGG CTG CAA CCA GGC CAG ACT TTG 4020 
Lys Leu Leu Leu Lys Ser Ala Gin Trp Leu Gin Pro Gly Gin Thr Leu 
1260 1265 1270 
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GTC CTT ATT. ATC CAC GGC GCA GAT AAG TTG GTG GAC CAT AAT GGA CAG 4068 
Val Leu He He Asp Gly Ala Asp Lys Leu Val Asp His Asn Gly Gin 
1275 1280 1285 1290 

CTG ATT TCA GAC TGG ATC CCC AAG TCT CTT CCG CGG CGA GTA CAC CTG 4116 
Leu He Ser Asp Trp He Pro Lys Ser Leu Pro Arg Arg Val His Leu 

1295 1300 1305 

GTG CTG ACT GTG TCT ACT GAC TCA GGC CTG GGA GAG ACC CTT CAG CAA 4164 
Val Leu Ser Val Ser Ser Asp Ser Gly Leu Gly Glu Thr Leu Gin Gin 

1310 1315 1320 

ACT CAG ACT GCT TAT GTG GTG GCC TTG GGG TCT TTG GTC CCG TCT TCA 4212 
Ser Gin Ser Ala Tyr Val Val Ala Leu Gly Ser Leu Val Pro Ser Ser 

1325 1330 1335 

AGG GCT CAG CTT GTG AGA GAA GAG CTA GCA CTG TAT GGG AAA CGG CTG 4260 
Arg Ala Gin Leu Val Arg Glu Glu Leu Ala Leu Tyr Gly Lys Arg Leu 

1340 1345 1350 

GAG GAG TCA CCT TTT AAC AAC CAG ATG CGG CTG CTG CTG GCA AAG CAG 4308 
Glu Glu Ser Pro Phe Asn Asn Gin Met Arg Leu Leu Leu Ala Lys Gin 
1355 1360 1365 1370 

GGG TCA AGC CTG CCA CTG TAC CTG CAC CTC GTC ACT GAC TAC CTG AGG 4356 
Gly Ser Ser Leu Pro Leu Tyr Leu His Leu Val Thr Asp Tyr Leu Arg 

1375 1380 1385 

CTT TTC ACA CTG TAC GAA CAG GTG TCT GAG AGA CTT CGA ACC CTG CCC 4404 
Leu Phe Thr Leu Tyr Glu Gin Val Ser Glu Arg Leu Arg Thr Leu Pro 

1390 1395 1400 

GCC ACT CTC CCA CTG CTG CTG CAG CAC ATC CTG AGC ACC TTG GAG CAA 4452 
Ala Thr Leu Pro Leu Leu Leu Gin His lie Leu Ser Thr Leu Glu Gin 

1405 1410 1415 

GAG CAT GGC CAT AAC GTC CTT CCT CAA GCT TTG ACT GCC CTT GAG GTC 4500 
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Glu His Gly His Asn Val Leu Pro Gin Ala Leu Thr Ala Leu Glu Val 

1420 1425 1430 

ACG CAC ACT GGT CTG ACT GTG GAC CAG CTG CAT GCA GTC CTG AGC ACG 4548 
Thr His Ser Gly Leu Thr Val Asp Gin Leu His Ala Val Leu Ser Thr 
1435 1440 1445 1450 

TGG TTG ACT TTG CCC AAG GAG ACT AAG AGC TGG GAA GAG GCA GTG GCT 4596 
Trp Leu Thr Leu Pro Lys Glu Thr Lys Ser Trp Glu Glu Ala Val Ala 

1455 1460 U65 

GCC ACT CAC ACT GGA AAC CTC TAC CCC TTG GCT CCA TTT GCC TAC CTT 4644 
Ala Ser His Ser Gly Asn Leu Tyr Pro Leu Ala Pro Phe Ala Tyr Leu 

1470 1475 148O 

GTC CAG ACT CTA CGC ACT TTA CTA GGC GAG GGC CCC GTC GAG CGC CCT 4692 
Val Gin Ser Leu Arg Ser Leu Leu Gly Glu Gly Pro Val Glu Arg Pro 

1485 1490 1495 

GGC GCC CGT CTC TGC CTC TCT GAT GGG CCT CTG AGG ACA GCA GTT AAA 4740 
Gly Ala Arg Leu Cys Leu Ser Asp Gly Pro Leu Arg Thr Ala Val Lys 

1500 1505 1510 

CGT CGC TAT CGG AAA AGG CTG GGG CTA GAG AAG ACT GCG CAT GTC CTC 4788 
Arg Arg Tyr Gly Lys Arg Leu Gly Leu Glu Lys Thr Ala His Val Leu 
1515 1520 1525 1530 

ATT GCA GCT CAC CTC TGG AAG ATG TGT GAC CCT GAT GCC TCA GGC ACC 4836 
He Ala Ala His Leu Trp Lys Met Cys Asp Pro Asp Ala Ser Gly Thr 

1535 1540 1545 

TTC CGA ACT TGC CCT CCC GAG GCT CTG AAA GAT TTA CCT TAC CAC CTG 4884 
Phe Arg Ser Cys Pro Pro Glu Ala Leu Lys Asp Leu Pro Tyr His Leu 

1550 1555 1560 

CTC CAG AGC GGG AAC CAT GGT CTC CTT GCA AAG TTC CTT ACC AAC CTC 4932 
Leu Gin Ser Gly Asn His Gly Leu Leu Ala Lys Phe Leu Thr Asn Leu 
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1565 1570 1575 

CAT GTG GTG GCT GCA TAT CTG GAA GTG GGT CTA GTC CCG GAC CTC TTG 4980 
His Val Val Ala Ala Tyr Leu Glu Val Gly Leu Val Pro Asp Leu Leu 

1580 1585 1590 

GAG GCT TAC GAG CTC TAT GCT TCT TCA AAG CCT GAA GTG AAC CAG AAG 5028 
Glu Ala Tyr Glu Leu Tyr Ala Ser Ser Lys Pro Glu Val Asn Gin Lys 
1595 1600 1605 1610 

CTC CCG GAG GCA GAT GTT GCT GTA TTC CAC AAC TTC CTG AAA CAA CAG 5076 
Leu Pro Glu Ala Asp Val Ala Val Phe His Asn Phe Leu Lys Gin Gin 

1615 1620 1625 

GCT TCA CTC CTT ACC CAG TAT CCT TTG CTC CTG CTC CAG CAG GCA GCT 5124 
Ala Ser Leu Leu Thr Gin Tyr Pro Leu Leu Leu Leu Gin Gin Ala Ala 

1630 1635 1640 

AGC CAG CCT GAA GAG TCA CCT GTT TGC TGC CAG GCC CCC CTG CTC ACC 5172 
Ser Gin Pro Glu Glu Ser Pro Val Cys Cys Gin Ala Pro Leu Leu Thr 

1645 1650 1655 

CAG CGG TGG CAC AAC CAG TGC ATA CTG AAA TGC ATT AAT AAA CCC CAC 5220 
Gin Arg Trp His Asn Gin Cys lie Leu Lys Trp He Asn Lys Pro Gin 

1660 1665 1670 

ACC TTG AAG GGT CAG CAA AGC TTG TCT CTG CCA ATT TCC TCA TCC CCA 5268 
Thr Leu Lys Gly Gin Gin Ser Leu Ser Leu Pro He Ser Ser Ser Pro 
1675 1680 1685 1690 

ACT GCT GTG GCC TTC TCT CCT AAT GGG CAA AGA GCA GCT GTG GGG ACT 5316 
Thr Ala Val Ala Phe Ser Pro Asn Gly Gin Arg Ala Ala Val Gly Thr 

1695 1700 1705 

GCT GGT GGG ACA ATT TAC CTG TTG AAC TTG AGA ACC TGG CAG GAG GAG 5364 
Ala Gly Gly Thr He Tyr Leu Leu Asn Leu Arg Thr Trp Gin Glu Glu 
1710 1715 1720 
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AAG CCT CTC GTG AGT GGC TGT GAT GGG ATT TCC TCT TTC GCG TTC CTG 5412 
Lys Ala Leu Val Ser Gly Cys Asp Gly He Ser Ser Phe Ala Phe Leu 

1725 1730 1735 

TCA GAC ACT GCT CTT TTC CTT ACC ACC TTC GAT GGG CTC CTG GAG CTT 5460 
Ser Asp Thr Ala Leu Phe Leu Thr Thr Phe Asp Gly Leu Leu Glu Leu 

1740 1745 1750 

TGG GAC CTG CAA CAT GGT TGT TGG GTG TTC CAG ACC AAG GCC CAC CAG 5508 
Trp Asp Leu Gin His Gly Cys Trp Val Phe Gin Thr Lys Ala His Gin 
1755 1760 1765 n 70 

TAC CAA ATC ACT GGC TGC TGC CTG ACC CCA GAC CGC CGC CTG CTG GCC 5556 
Tyr Gin He Thr Gly Cys Cys Leu Ser Pro Asp Arg Arg Leu Leu Ala 

177 5 1780 1785 

ACC GTG TGT TTG GGA GGA TAC GTA AAG CTG TGG GAC ACA GTC CAG GGC 5604 
Thr Val Cys Leu Gly Gly Tyr Val Lys Leu Trp Asp Thr Val Gin Gly 

1790 1795 1800 

CAG CTG GCT TTC CAC TAC ACC CAT CCC AAG TCT CTA AAC TGC ATC ACC 5652 
Gin Leu Ala Phe Gin Tyr Thr His Pro Lys Ser Leu Asn Cys lie Thr 

1805 1810 1815 

TTC CAC CCA GAG GGG CAG GTG GTA GCC ACA GGC AAC TGG TCT GGC ATC 5700 
Phe His Pro Glu Gly Gin Val Val Ala Thr Gly Asn Trp Ser Gly He 

1820 1825 1830 

GTG ACC TTC TTC CAG GCA GAT GGA CTC AAA GTC ACC AAG GAA CTA GGG 5748 
Val Thr Phe Phe Gin Ala Asp Gly Leu Lys Val Thr Lys Glu Leu Gly 
1835 1840 1845 1850 

GGC CCA GGA CCC TCT GTT CGT ACG CTG GCA TTC AGT GCA CCC GGG AAG 5796 
Gly Pro Gly Pro Ser Val Arg Thr Leu Ala Phe Ser Ala Pro Gly Lys 

1855 I860 1865 

GTT GTG GCT CTA GGC CGG ATA GAT GGG ACA GTG GAG CTG TGG GCC TGG 5844 
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Val Val Ala Leu Cly Arg lie Asp Gly Thr Val Clu Leu Trp Ala Trp 

1870 1875 i 88 o 

CAA GAG CGC ACA CGG CTG GCA GCC TTC CCT GCA CAG TGT GGC GGT GTC 5892 
Gin Glu Gly Thr Arg Leu Ala Ala Phe Pro Ala Gin Cys Gly Cly Val 

1885 1890 1895 

TCC ACC GTT CTT TTC TTG CAT GCT GGA GGC CGG TTC CTG ACG GCT GGG 5940 
Ser Thr Val Leu Phe Leu His Ala Gly Gly Arg Phe Leu Thr Ala Gly 

1900 1905 1910 

GAA GAT GGC AAG CCT CAG TTA TGG TCA GGA TTT CTT GGC CGG CCC ACC 5988 
Clu Asp Gly Lys Ala Gin Leu Trp Ser Gly Phe Leu Gly Arg Pro Arg 

1915 1920 1925 1930 

GGT TGC CTG GGC TCT CTT TAT CTT TCT CCT GCC CTC TCT GTG GCT CTC 6036 

Gly Cys Leu Gly Ser Leu Tyr Leu Ser Pro Ala Leu Ser Val Ala Leu 

1935 1940 1945 

AAC CCA GAC GGT GAC CAG GTG GCT GTT CGC TAC CCA GGA GAT GGC ATT 6084 
Asn Pro Asp Gly Asp Gin Val Ala Val Gly Tyr Arg Gly Asp Gly lie 

1950 1955 i960 

AAA ATC TAC AGA ATT TCT TCA GCT CCC CAC GAG GCT CAA TGC CAA GAG 6132 
Lys lie Tyr Arg He Ser Ser Gly Pro Gin Glu Ala Gin Cys Gin Glu 

1965 1970 ig 75 

CTA AAT GTG GCC GTG TCT CCA CTG GTC TGG CTG ACT CCC AGC GTC TTG 6180 
Leu Asn Val Ala Val Ser Ala Leu Val Trp Leu Ser Pro Ser Val Leu 
1980 1985 1990 

GTG ACT GGT GCA GAA GAT GGC TCC CTG CAT GGC TGC ATC CTC AGC AGA 6228 
Val Ser Gly Ala Glu Asp Gly Ser Leu His Gly Trp Met Leu Arg Arg 
1995 2000 2005 2010 

AAC TCC CTT CAG TCC CTG TGG CTG TCA TCC GTC TCC CAG AAG CCT GTG 6276 
Asn Ser Leu Gin Ser Leu Trp Leu Ser Ser Val Cys Gin Lys Pro Val 
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2015 2020 2025 

CTG CCC CTC GCT GCC TCC CAG GAG TTC TTG GCT TCT GCC TCA GAG GAC 6324 
Leu Gly Leu Ala Ala Ser Gin Glu Phe Leu Ala Ser Ala Ser Glu Asp 

2030 2035 2040 

TTC ACG GTG CGA CTG TGG CCA AGA CAG CTG CTG ACA CAG CCA CAT CCA 6372 
Phe Thr Val Arg Leu Trp Pro Arg Gin Leu Leu Thr Gin Pro His Ala 

2045 2050 2055 

GTA GAA GAG TTG CCC TGT GCG GCT GAA CTC CGG GGA CAC GAG GGG CCG 6420 
Val Glu Glu Leu Pro Cys Ala Ala Glu Leu Arg Gly His Glu Gly Pro 

2060 2065 2070 

GTG TGC TGC TGT AGC TTC AGC CCG GAT GGA CGC ATC TTG GCC ACA GCG 6468 
Val Cys Cys Cys Ser Phe Ser Pro Asp Gly Arg He Leu Ala Thr Ala 
2075 2080 2085 2090 

GCC AGG GAT CGG AAT CTC CTC TGC TGG GAC GTC AAG GTA GCC CAA GCC 6516 
Gly Arg Asp Arg Asn Leu Leu Cys Trp Asp Val Lys Val Ala Gin Ala 

2095 2100 2105 

CCT CTC CTG ATT CAC ACG TTC TCG TCC TGT CAT CGA GAC TGG ATC ACT 6564 
Pro Leu Leu lie His Thr Phe Ser Ser Cys His Arg Asp Trp lie Thr 

2110 2115 2120 

GGC TGT ACG TGG ACC AAA GAC AAC ATC CTG ATC TCC TGC TCT ACT GAT 6612 
Gly Cys Thr Trp Thr Lys Asp Asn He Leu lie Ser Cys Ser Ser Asp 

2125 2130 2135 

GGC TCT GTG GGA CTC TGG AAC CCA GAG GCA GGA CAG CAA CTT GGC CAG 6660 
Gly Ser Val Gly Leu Trp Asn Pro Glu Ala Gly Gin Gin Leu Gly Gin 

2140 2145 2150 

TTC CCA GGT CAC CAG ACT GCC GTG AGC GCT GTG GTT GCT GTG GAG GAA 6708 
Phe Pro Gly His Gin Ser Ala Val Ser Ala Val Val Ala Val Glu Glu 
2155 2160 2165 2170 
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CAC ATT CTA TCT CTG ACT CCC GAT GGG ACC TTG AAA GTG TGG GAC CGT 6756 
His He Val Ser Val Ser Arg Asp Gly Thr Leu Lys Val Trp Asp Arg 

2175 2180 2185 

CAG GGT GTG GAG CTG ACC AGC ATC CCT GCC CAT TCC GGA CCC ATT AGC 6804 
Gin Gly Val Glu Leu Thr Ser He Pro Ala His Ser Gly Pro He Ser 

2190 2195 2200 

CAG TGT GCG GCT GCT CTG GAA CCC CGT CCA GCT GGA CAG CCT GGA TCA 6852 
Gin Cys Ala Ala Ala Leu Glu Pro Arg Pro Ala Gly Gin Pro Gly Ser 

2205 2210 2215 

GAG CTT ATG GTG GTG ACT GTT GGA CTG GAT GGG GCC AC A A AG CTG TGG 6900 
Glu Leu Het Val Val Thr Val Gly Leu Asp Gly Ala Thr Lys Leu Trp 

2220 2225 2230 

CAT CCC CTG TTG GTG TGC CAA ATA CAT ACC CTG CAG GGA CAC ACT GGT 6948 
His Pro Leu Leu Val Cys Gin He His Thr Leu Gin Gly His Ser Gly 
2235 2240 2245 2250 

CCA GTC ACA GCT GCT GCT GCT TCA GAG GCC TCA GGC CTC CTG CTG ACC 6996 
Pro Val Thr Ala Ala Ala Ala Ser Glu Ala Ser Gly Leu Leu Leu Thr 

2255 2260 2265 

TCA GAC AAT ACC TCT GTA CGA CTC TGG CAG ATC CCT AAG GAA GCA GAT 7044 
Ser Asp Asn Ser Ser Val Arg Leu Trp Gin lie Pro Lys Glu Ala Asp 

2270 2275 2280 

GAT ACC TGC AAA CCT AGG ACT TCT GCG GTC ATC ACC GCT GTG GCG TGG 7092 
Asp Thr Cys Lys Pro Arg Ser Ser Ala Val lie Thr Ala Val Ala Trp 

2285 2290 2295 

GCA CCA GAT GGT TCT CTG GTG GTG TCT GGA AAT GAA GCT GGG GAA CTA 7140 
Ala Pro Asp Gly Ser Leu Val Val Ser Gly Asn Glu Ala Gly Glu Leu 

2300 2305 2310 

ACG CTG TGG CAG AAA GCG CAG GCT GTG GCT ACG GCA CGG GCT CCA GGC 7188 
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Thr Leu Trp Cln Lys Ala Gin Ala Val Ala Thr Ala Arg Ala Pro Gly 
2315 2320 2325 2330 

CGC GTC ACT GAC CTG ATC TGG TGC TCC GCA AAT GCA TTC TTT GTT CTC 7236 
Arg Val Ser Asp Leu He Trp Cys Ser Ala Asn Ala Phe Phe Val Leu 

2335 2340 2345 

ACT GCT AAT GAA AAT GTC ACT GAG TGG CAA GTG GAA CTG AGG AAA GGT 7284 
Ser Ala Asn Glu Asn Val Ser Glu Trp Gin Val Clu Leu Arg Lys Gly 

2350 2355 2360 

TCA ACA TGC ACC AAT TTC AGA CTT TAT CTG AAG AGA GTT CTG CAG GAG 7332 
Ser Thr Cys Thr Asn Phe Arg Leu Tyr Leu Lys Arg Val Leu Gin Glu 

2365 2370 2375 

GAC TTG GGA GTC TTG ACA GGT ATG GCC CTG GCG CCT GAC GGC CAG TCT 7380 
Asp Leu Gly Val Leu Thr Gly Met Ala Leu Ala Pro Asp Gly Gin Ser 

2380 2385 2390 

CTC ATT TTG ATG AAA GAG GAT GTA GAA TTG CTA CAG ATG AAG CCC GGG 7428 
Leu He Leu Met Lys Glu Asp Val Glu Leu Leu Gin Het Lys Pro Gly 
2395 2400 2405 2410 

TCT ACT CCA TCT TCG ATC TGC AGG AGG TAT GCA GTG CAT TCT TCT ATA 7476 
Ser Thr Pro Ser Ser He Cys Arg Arg Tyr Ala Val His Ser Ser He 

2415 2420 2425 

CTG TGC ACC AGC AAA GAC TAT GGC CTG TTT TAC CTG CAG CAG GGA AAC 7524 
Leu Cys Thr Ser Lys Asp Tyr Gly Leu Phe Tyr Leu Gin Gin Gly Asn 

2430 2435 2440 

TCT GGA TCT CTT TCT ATC TTG GAG CAG GAG GAG TCA GGG AAG TTT GAA 7572 
Ser Gly Ser Leu Ser He Leu Glu Gin Glu Glu Ser Gly Lys Phe Glu 

2445 2450 2455 

AAG ACC CTG GAC TTC AAT CTG AAC TTA AAT AAT CCT AAT GGG TCC CCA 7620 
Lys Thr Leu Asp Phe Asn Leu Asn Leu Asn Asn Pro Asn Gly Ser Pro 
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2460 2465 2470 

GTA TCA ATC ACT CAG GCT GAA CCT GAG TCT GGG TCC TCG CTT TTG TGT 7668 
Val Ser He Thr Gin Ala Glu Pro Glu Ser Gly Ser Ser Leu Leu Cys 
2475 2480 2485 2490 

GCT ACC TCT GAT GGG ATG CTG TGG A AC TTA TCT GAG TGT ACC CCA GAA 7716 
Ala Thr Ser Asp Gly Met Leu Trp Asn Leu Ser Glu Cys Thr Pro Glu 

2495 2500 2505 

GGA GAG TGG GTC GTA GAT AAC ATC TGG CAG AAA AAA TCA AGA AAC CCT 7764 
Gly Glu Trp Val Val Asp Asn He Trp Gin Lys Lys Ser Arg Asn Pro 

2510 2515 2520 

AAA ACT CGA ACT CCG GGG ACA GAT TCG TCC CCA GGC TTA TTC TGC ATG 7812 
Lys Ser Arg Thr Pro Gly Thr Asp Ser Ser Pro Gly Leu Phe Cys Met 

2525 2530 2535 

GAT AGC TGG GTA GAA CCC ACA CAT TTA AAG CCA ' CCG CAG TGT AAA AAC 7860 
Asp Ser Trp Val Glu Pro Thr His Leu Lys Ala Arg Gin Cys Lys Lys 

2540 2545 2550 

ATT CAC TTG GGC TCT GTC ACG GCC CTC CAT GTC CTG CCC GGA TTG CTG 7908 
He His Leu Gly Ser Val Thr Ala Leu His Val Leu Pro Gly Leu Leu 
2555 25 60 2565 2570 

CTG ACT GCT TCA GAG GAC AGA GAT GTT AAG CTG TGG GAG AGA CCC ACT 7956 
Val Thr Ala Ser Glu Asp Arg Asp Val Lys Leu Trp Glu Arg Pro Ser 

2575 2580 2585 

ATG CAG CTG CTC GGC TTG TTC CGA TGT GAA GGG CCG CTG AGC TGT CTG 8004 
Het Gin Leu Leu Gly Leu Phe Arg Cys Glu Gly Pro Val Ser Cys Leu 

2590 2595 2600 

GAA CCT TGG ATG GAG CCC AGC TCT CCC CTG CAG CTT GCT GTG GGA GAT 8052 
Glu Pro Trp Het Glu Pro Ser Ser Pro Leu Gin- Leu Ala Val Gly Asp 
2605 2610 2615 
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GCA CAA GGA AAC TTG TAT TTT CTA TCT TGG GAA TGAAGATGAA 8095 
Ala Gin Gly Asn Leu Tyr Phe Leu Ser Trp Glu *** 

2620 2625 2629 

GAATCAGGAC AAAGATGGTG TCACCGGATG ATGGTCACCT GAAGACACCA GTGTCTATAT 8155 
TCTTAATAAG GTTATAAAAT AAAGTGTTGG AAGATCTAAA AAAAAAAAAA AAAAAAAAAA 8215 

rnvm^ : 2 

mmv&z : mm=4 8 7. t i ;®= 1 6 2 

W.Jtl<Dmm : c DNA 
: t h 

AAG TTC GCG CAG TTT GAC GAG TAC CAG CTG GCT AAG TAC AAC CCT CGG 48 
Lys Phe Ala Gin Phe Asp Glu Tyr Gin Leu Ala Lys Tyr Asn Pro Arg 

1 5 10 15 

AAG CAC CGG GCC AAG AGA CAC CCC CGC CGC CCA CCC CGC TCT CCA GGG 96 
Lys His Arg Ala Lys Arg His Pro Arg Arg Pro Pro Arg Ser Pro Gly 

20 25 30 

ATG GAG CCT CCA TTT TCT CAC AGA TGT TTT CCA AGG TAC ATA GGG TTT 144 
Met Glu Pro Pro Phe Ser His Arg Cys Phe Pro Arg Tyr He Gly Phe 

35 40 45 

CTC AGA GAA GAG CAG AGA AAG TTT GAG AAG GCC GGT GAT ACA GTG TCA 192 
Leu Arg Glu Glu Gin Arg Lys Phe Glu Lys Ala Gly Asp Thr Val Ser 

50 55 60 

GAG AAA AAG AAT CCT CCA AGG TTC ACC CTG AAG AAG CTG GTT CAG CGA 240 
Glu Lys Lys Asn Pro Pro Arg Phe Thr Leu Lys Lys Leu Val Gin Arg 
65 70 75 80 
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CTC CAC ATC CAC AAG CCT GCC CAG CAC GTT CAA GCC CTG CTG GGT TAC 288 

Leu His He His Lys Pro Ala Gin His Val Gin Ala Leu Leu Gly Tyr 

85 go 95 

AGA TAC CCC TCC AAC CTA CAG CTC TTT TCT CGA ACT CCC CTT CCT GGG 336 

Arg Tyr Pro Ser Asn Leu Gin Leu Phe Ser Arg Ser Arg Leu Pro Gly 

100 105 no 

CCT TGG GAT TCT AGC AGA GCT GGG AAC AGG ATG AAG CTG TCT AGG CCA 384 

Pro Trp Asp Ser Ser Arg Ala Gly Lys Arg Met Lys Leu Ser Arg Pro 

115 120 125 

GAG ACC TGG GAG CGG GAG CTG AGC CTA CGG GGG AAC AAA GCG TCG GTC 432 

Glu Thr Trp Glu Arg Glu Leu Ser Leu Arg Gly Asn Lys Ala Ser Val 

130 135 140 

TGG GAG GAA CTC ATT GAA AAT GGG AAG CTT CCC TTC ATG GCC ATG CTC 480 

Trp Glu Glu Leu lie Glu Asn Gly Lys Leu Pro Phe Met Ala Met Leu 

145 150 155 160 

AGC ATC T 

487 

Ser He 
162 
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CGGATGTAGA TGGTCAGGAG ATTAAGTGAT GCTGCCTGCA CCTTATCTTT GCAGGTATTG 
GGTCATTCTC AGAAATTATT TTATCAGGCA GATTCAATAA GCAAGATAGT TTAGGGCGTG 
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AAGCTATCGG ATCAGTCACC AAGATGCCAA TCTCTTCTAC CCTCTCTCC TAG GCA TCT 178 

*** Ala Ser 
1 

TTG TAT GCT AGG CAG CAG CTT AAC CTC CGG GAC ATA GCC AAT ATA GTG 226 
Leu Tyr Ala Arg Gin Gin Leu Asn Leu Arg Asp I le Ala Asn 1 le Val 

5 10 15 

TTG GCC GTG GCT GCC CTC TTG CCA GCC TGC CGC CCC CAT GTA CGA CGG 274 
Leu Ala Val Ala Ala Leu Leu Pro Ala Cys Arg Pro His Val Arg Arg 

20 25 30 

TAT TAC TCT GCC ATT GTT CAC CTG CCT TCA GAC TGG AAC CAG GTA GCC 322 
Tyr Tyr Ser Ala lie Yal His Leu Pro Ser Asp Trp Asn Gin Val Ala 
35 40 45 50 

GAG TTC TAC CAG GTA TGG TAC TTA G 347 
Glu Phe Tyr Gin Val Trp Tyr Leu 
55 
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AGC TTG GGG GGA GAA GAA GAA GAA GTG GTG GGG GCA CCG GTC CTA AAA 48 
Ser Leu Gly Gly Glu Glu Glu Glu VA1 Val Gly Ala Pro Val Leu Lys 

15 10 15 

CTC ACA TCT GGA GAC TCT GAC TCT CAC CCT GAA ACC ACT GAC CAG ATC 96 
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Leu Thr Ser Cly Asp Ser Asp Ser His Pro Glu Thr Thr Asp Gin He 

20 25 30 

CTG CAG GAG AAG AAG ATG GCT CTC TTG ACC TTG CTG TGC TCA GCT ATG 144 
Leu Gin Glu Lys Lys Met Ala Leu Leu Thr Leu Leu Cys Ser Ala Met 

35 40 45 

GCC TCA AGT GTG AAT GTG AAA GAT GCC TCC GAT CCT ACC CGG GCA TCT 192 
Ala Ser Ser Val Asn Val He Tyr Ala Ser Asp Pro Thr Arg Ala Ser 

50 55 60 

ATC CAT GAA GTC TGC AGT GCG CTG GCC CCC TTG GAA CCT GAG TTC ATC 240 
He His Glu Val Cys Ser Ala Leu Ala Pro Leu Glu Pro Glu Phe He 
65 70 75 80 

CTT AAG GCA TCT TTG TAT GCT AGG CAG CAG CTT AAC CTC CGG GAC ATA 288 
Leu Lys Ala Ser Leu Tyr Ala Arg Gin Gin Leu Asn Leu Arg Asp He 

85 90 95 

GCC AAT ATA GTG TTG GAA GTG GCT GCC CTC TTG CCA GCC TGC CGC CCC 336 
Ala Asn He Val Lys Ala Val Ala Ala Leu Leu Pro Ala Cys Arg Pro 

100 105 110 

CAT GTA CGA CGG TAT TAC TCT GCC ATT GTT CAC CTG CCT TCA GAC TGG 384 
His Val Arg Arg Tyr Tyr Ser Ala He Val His Leu Pro Ser Asp Trp 

115 120 125 

ATC CAG GTA GCC GAG TTC TAC CAG 408 
He Glu Val Ala Glu Phe Tyr Gin 
130 135 
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CCCACTCTAA GCCATATACA AGCTGAGTTT CAGCC ATG GAA AAA CTC CAT 170 

Met Glu Lys Leu His 
1 5 

GCG CAT GTG TCT GCC CAT CCA GAC ATC CTC TCC TTG GAG AAC CGG TGC 218 
Gly His Val Ser Ala His Pro Asp lie Leu Ser Leu Glu Asn Arg Cys 

10 15 • 20 

CTG GCT ATG CTC CCT GAC TTA CAG CCC TTG GAG AAA CTA CAT CAG CAT 266 
Leu Ala Bet Leu Pro Asp Leu Gin Pro Leu Glu Lys Leu His Gin His 

25 30 35 

GTA TCT ACC CAC TCA GAT ATC CTC TCC TTG AAG AAC CAG TGC CTA GCC 314 
Val Ser Thr His Ser Asp He Leu Ser Leu Lys Asn Gin Cys Leu Ala 

40 45 50 

ACG CTT CCT GAC CTG AAG ACC ATG GAA AAA CCA CAT GGA TAT GTG TCT 362 
Thr Leu Pro Asp Leu Lys Thr Met Glu Lys Pro His Gly Tyr Val Ser 

55 60 65 

GCC CAC CCA GAC ATC CTC TCC TTG GAG AAC CAG TGC CTG GCC ACA CTT 410 
Ala His Pro Asp lie Leu Ser Leu Glu Asn Gin Cys Leu Ala Thr Leu 
™ 75 80 85 

TCT GAC CTG AAG ACC ATG GAG AAA CCA CAT GGA CAT GTT TCT GCC CAC 458 
Ser Asp Leu Lys Thr Met Glu Lys Pro His Gly His Val Ser Ala His 

90 95 100 

CCA GAC ATC CTC TCC TTG GAG AAC CGA TGC CTG GCC ACC CTC TCT ACT 506 
Pro Asp lie Leu Ser Leu Glu Asn Arg Cys Leu Ala Thr Leu Ser Ser 

105 HO us 

CTA AAG AGC ACT GTG TCT GCC AGC CCC TTG TTC CAG ACT CTA CAG ATA 554 
Leu Lys Ser Thr Val Ser Ala Ser Pro Leu Phe Gin Ser Leu Gin He 

120 125 130 

TCT CAC ATG ATG CAA GCT GAT TTG TAC CGT GTG AAC AAC AGC AAT TGC 602 
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Ser His Met Het Gin Ala Asp Leu Tyr Arg Val Asn Asn Ser Asn Cys 

135 140 145 

CTG CTC TCT GAG CCT CCA ACT TGG AGG GCT CAG CAT TTC TCT AAG GGA 650 
Leu Leu Ser Glu Pro Pro Ser Trp Arg Ala Gin His Phe Ser Lys Gly 
150 ' 155 160 165 

CTA GAC CTT TCA ACC TGC CCT ATA GCC CTG AAA TCC ATC TCT GCC ACA 698 
Leu Asp Leu Ser Thr Cys Pro He Ala Leu Lys Ser lie Ser Ala Thr 

170 175 180 

GAG ACA GCT CAG GAA GCA ACT TTG GGT CGT TGG TTT GAT TCA GAA GAG 746 
Glu Thr Ala Gin Glu Ala Thr Leu Gly Arg Trp Phe Asp Ser Glu Glu 

185 190 195 

AAG AAA GGG GCA GAG ACC CAA ATG CCT TCT TAT ACT CTG AGC TTG GGA 794 
Lys Lys Gly Ala Glu Thr Gin Met Pro Ser Tyr Ser Leu Ser Leu Gly 

200 205 210 

GAG GAG GAG GAG GTG GAG GAT CTG GCC GTG AAG CTC ACC TCT GGA GAC 842 
Glu Glu Glu Glu Val Glu Asp Leu Ala Val Lys Leu Thr Ser Gly Asp 

215 220 225 

TCT GAA TCT CAT CCA GAG CCT ACT GAC CAT GTC CTT CAG GAA AAG AAG 890 
Ser Glu Ser His Pro Glu Pro Thr Asp His Val Leu Gin Glu Lys Lys 
230 235 240 245 

ATG GCT CTA CTG AGC TTG CTG TGC TCT ACT CTG GTC TCA GAA GTA AAC 938 
Het Ala Leu Leu Ser Leu Leu Cys Ser Thr Leu Val Ser Glu Val Asn 

250 255 260 

ATG AAC AAT ACA TCT GAC CCC ACC CTG GCT GCC ATT TTT GAA ATC TGT 986 
Het Asn Asn Thr Ser Asp Pro Thr Leu Ala Ala lie Phe Glu He Cys 

265 270 275 

CGT GAA CTT GCC CTC CTG GAG CCT GAG TTT ATC CTC AAG GCA TCT TTG 1034 
Arg Glu Leu Ala Leu Leu Glu Pro Glu Phe lie Leu Lys Ala Ser Leu 
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280 285 290 

TAT GCC AGG CAG CAG CTG A AC GTC CGG AAT GTG GCC AAT AAA ATC TTG 1082 

Tyr Ala Arg Gin Gin Leu Asn Val Arg Asn Val Ala Asn Lys He Leu 

295 300 305 

GCC ATT GCT GCT TTC TTG CCG GCG TGT CGC CCC CAC CTG CGA CGA TAT 1130 

Ala lie Ala Ala Phe Leu Pro Ala Cys Arg Pro His Leu Arg Arg Tyr 
310 315 320 325 

TTC TGT GCC ATT GTC CAG CTG CCT TCT GAC TGG ATC CAG GTG GCT GAG 1178 

Phe Cys Ala lie Val Gin Leu Pro Ser Asp Trp He Gin Val Ala Glu 

330 335 340 

CTT TAC CAG AGC CTG GCT GAG GGA GAT AAG AAT AAG CTG GTG CCC CTG 1226 

Leu Tyr Gin Ser Leu Ala Glu Gly Asp Lys Asn Lys Leu Val Pro Leu 

345 350 355 

CCC GCC TGT CTC CGT ACT GCC ATG ACG GAC AAA TTT GCC CAG TTT GAC 1274 

Pro Ala Cys Leu Arg Thr Ala Met Thr Asp Lys Phe Ala Gin Phe Asp 

360 365 370 

GAG TAC CAG CTG GCT AAG TAC AAC CCT CGG AAG CAC CGG GCC AAG AGA 1322 . 

Glu Tyr Gin Leu Ala Lys Tyr Asn Pro Arg Lys His Arg Ala Lys Arg 

375 380 385 

CAC CCC CGC CGG CCA CCC CGC TCT CCA GGG ATG GAG CCT CCA TTT TCT 1370 

His Pro Arg Arg Pro Pro Arg Ser Pro Gly Bet Glu Pro Pro Phe Ser 
390 395 400 405 

CAC AGA TGT TTT CCA AGG TAC ATA GGG TTT CTC AGA GAA GAG CAG AGA 1418 

His Arg Cys Phe Pro Arg Tyr He Gly Phe Leu Arg Glu Glu Gin Arg 

410 415 420 

AAG TTT GAG AAG GCC GGT GAT ACA GTG TCA GAG AAA AAG AAT CCT CCA 1466 

Lys Phe Glu Lys Ala Gly Asp Thr Val Ser Glu Lys Lys Asn Pro Pro 
425 430 435 
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AGG TTC ACC CTG AAG AAG CTG GTT CAG CGA CTG CAC ATC CAC AAG CCT 1514 
Arg Phe Thr Leu Lys Lys Leu Val Gin Arg Leu His lie His Lys Pro 

440 445 450 

GCC CAG CAC GTT CAA GCC CTG CTG GGT TAC AGA TAC CCC TCC AAC CTA 1562 
Ala Gin His Val Gin Ala Leu Leu Gly Tyr Arg Tyr Pro Ser Asn Leu 

455 460 465 

CAG CTC TTT TCT CGA ACT CGC CTT CCT GGG CCT TGG GAT TCT AGC AGA 1610 
Gin Leu Phe Ser Arg Ser Arg Leu Pro Gly Pro Trp Asp Ser Ser Arg 
470 475 480 485 

CCT GGG AAG AGG ATG AAG CTG TCT AGG CCA GAG ACC TGG GAG CGG GAG 1658 
Ala Gly Lys Arg Het Lys Leu Ser Arg Pro Glu Thr Trp Glu Arg Glu 

490 495 500 

CTG AGC CTA CGG GGG AAC AAA GCG TCG GTC TGG GAG GAA CTC ATT GAA 1706 
Leu Ser Leu Arg Gly Asn Lys Ala Ser Val Trp Glu Glu Leu He Clu 

505 510 515 

AAT GGG AAG CTT CCC TTC ATG GCC ATG CTT CGG AAC CTG TGC AAC CTG 1754 
Asn Gly Lys Leu Pro Phe Het Ala Met Leu Arg Asn Leu Cys Asn Leu 

520 525 530 

CTG CGG GTT GGA ATC ACT TCC CGC CAC CAT GAG CTC ATT CTC CAG AGA 1802 
Leu Arg Yal Gly lie Ser Ser Arg His His Glu Leu lie Leu Gin Arg 

535 540 545 

CTC CAG CAT GCG AAG TCG GTG ATC CAC ACT CGG CAG TTT CCA TTC AGA 1850 
Leu Gin His Ala Lys Ser Val He His Ser Arg Gin Phe Pro Phe Arg 
550 555 560 565 

TTT CTT AAC GCC CAT GAT GCC ATT GAT GCC CTC GAG GCT CAA CTC AGA 1898 
Phe Leu Asn Ala His Asp Ala lie Asp Ala Leu Glu Ala Gin Leu Arg 

570 575 580 

AAT CAA GCA TTG CCC TTT CCT TCG AAT ATA ACA CTC ATG AGG CGG ATA 1946 
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Asn Gin Ala Leu Pro Phe Pro Ser Asn He Thr Leu Met Arg Arg He 

585 590 595 

CTA ACT ACA AAT GAA AAG AAC CGT CCC AGG CGG AGG TTT CTT TGC CAC 1994 
Leu Thr Arg Asn Glu Lys Asn Arg Pro Arg Arg Arg Phe Leu Cys His 

600 605 610 

CTA AGC CGT CAG CAG CTT CGG ATG GCA ATG AGG ATA CCT GTG TTG TAT 2042 
Leu Ser Arg Gin Gin Leu Arg Met Ala Met Arg He Pro Val Leu Tyr 

615 620 625 

GAG CAG CTC AAG AGG GAG AAG CTG AGA GTA CAC AAG CCC AGA CAG TGG 2090 
Glu Gin Leu Lys Arg Glu Lys Leu Arg Val His Lys Ala Arg Gin Trp 
630 635 640 645 

AAA TAT GAT GGT GAG ATG CTG AAC AGG TAC CGA CAG GCC CTA GAG ACA 2138 
Lys Tyr Asp Gly Glu Het Leu Asn Arg Tyr Arg Gin Ala Leu Glu Thr 

650 655 660 

GCT GTG AAC CTC TCT GTG AAG CAC AGC CTG CCC CTG CTG CCA GGC CGC 2186 
Ala Val Asn Leu Ser Val Lys His Ser Leu Pro Leu Leu Pro Gly Arg 

665 670 575 

ACT GTC TTG GTC TAT CTG ACA GAT GCT AAT GCA GAC AGG CTC TGT CCA 2234 
Thr Val Leu Val Tyr Leu Thr Asp Ala Asn Ala Asp Arg Leu Cys Pro 

680 685 690 

AAG AGC AAC CCA CAA GGG CCC CCG CTG AAC TAT GCA CTG CTG TTG ATT 2282 
Lys Ser Asn Pro Gin Gly Pro Pro Leu Asn Tyr Ala Leu Leu Leu lie 

695 700 705 

GGG ATG ATG ATC ACG AGG GCG GAG CAG GTG GAC GTC GTG CTG TGT GGA 2330 
Gly Het Met lie Thr Arg Ala Glu Gin Val Asp Val Val Leu Cys Gly 
710 715 720 725 

GGT GAC ACT CTG AAG ACT GCA GTG CTT AAG GCA GAA GAA GGC ATC CTG 2378 
Gly Asp Thr Leu Lys Thr Ala Val Leu Lys Ala Glu Glu Gly lie Leu 
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730 735 740 

AAG ACT GCC ATC AAG CTC CAG GCT CAA GTC CAG GAG TTT GAT GAA AAT 2426 
Lys Thr Ala He Lys Leu Gin Ala Gin Val Gin Glu Phe Asp Glu Asn 

745 750 755 

GAT GGA TGG TCC CTG AAT ACT TTT GGG AAA TAC CTG CTG TCT CTG GCT 2474 
Asp Gly Trp Ser Leu Asn Thr Phe Gly Lys Tyr Leu Leu Ser Leu Ala 

760 765 770 

GGC CAA AGG GTT CCT GTG GAC AGG GTC ATC CTC CTT GGC CAA AGC ATG 2522 
Gly Gin Arg Val Pro Val Asp Arg Val lie Leu Leu Gly Gin Ser Met 

775 780 785 

GAT GAT GGA ATG ATA AAT GTG GCC AAA CAG CTT TAC TGG CAG CGT GTG 2570 
Asp Asp Gly Het He Asn Val Ala Lys Gin Leu Tyr Trp Gin Arg Val 
790 795 800 805 

AAT TCC AAG TGC CTC TTT GTT GGT ATC CTC CTA AGA AGG GTA CAA TAC 2618 
Asn Ser Lys Cys Leu Phe Val Gly He Leu Leu Arg Arg Val Gin Tyr 

810 815 820 

CTG TCA ACA GAT TTG AAT CCC AAT GAT GTG ACA CTC TCA GGC TGT ACT 2666 
Leu Ser Thr Asp Leu Asn Pro Asn Asp Val Thr Leu Ser Gly Cys Thr 

825 830 835 

GAT GCG ATA CTG AAG TTC ATT GCA GAG CAT GGG GCC TCC CAT CTT CTG 2714 
Asp Ala He Leu Lys Phe He Ala Glu His Gly Ala Ser His Leu Leu 

840 845 850 

GAA CAT GTG GGC CAA ATG GAC AAA ATA TTC AAG ATT CCA CCA CCC CCA 2762 
Glu His Val Gly Gin Met Asp Lys He Phe Lys He Pro Pro Pro Pro 

855 860 865 

GGA AAG ACA GGG GTC CAG TCT CTC CGG CCA CTG GAA GAG GAC ACT CCA 2810 
Gly Lys Thr Gly Val Gin Ser Leu Arg Pro Leu Glu Glu Asp Thr Pro 
870 875 880 885 
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AGC CCC TTG CCT CCT GTT TCC CAG CAA GGA TGG GGC AGC ATC CGG CTT 2858 
Ser Pro Leu Ala Pro Val Ser Gin Gin Gly Trp Gly Ser He Arg Leu 

890 895 900 

TTC ATT TCA TCC ACT TTC CGA GAC ATG CAC CGG GGA GCG GAC CTG CTG 2906 
Phe He Ser Ser Thr Phe Arg Asp Met His Arg Cly Ala Asp Leu Leu 

905 910 915 

CTG AGG TCT GTG CTG CCA GCA CTG CAG GCC CGA GCG GCC CCT CAC CGT 2954 
Leu Arg Ser Val Leu Pro Ala Leu Gin Ala Arg Ala Ala Pro His Arg 

920 925 930 

ATC AGC CTT CAC CGA ATC GAC CTC CGC TGG GGC GTC ACT GAG GAG GAG 3002 
He Ser Leu His Arg He Asp Leu Arg Trp Gly Val Thr Glu Clu Glu 

935 940 945 

ACC CGT AGG AAC AGA CAA CTG GAA GTG TGC CTT GGG GAG GTG GAG AAC 3050 
Thr Arg Arg Asn Arg Gin Leu Glu Val Cys Leu Gly Glu Val Glu Asn 
950 955 960 9 6 5 

GCA CAG CTG TTT GTG GGG ATT CTG GGC TCC CGT TAT GGA AAC ATT CCC 3098 
Ala Gin Leu Phe Val Gly He Leu Gly Ser Arg Tyr Gly Asn lie Pro 

970 975 9 80 

CCC AGC TAG AAC CTT CCT GAC CAT CCA CAC TTC CAC TGG GCC CAG CAG 3146 
Pro Ser Tyr Asn Leu Pro Asp His Pro His Phe His Trp Ala Gin Gin 

985 990 gg5 

TAC CCT TCA GGG CGC TCT GTG ACA GAG ATG GAG GTG ATG CAG TTC CTG 3194 
Tyr Pro Ser Gly Arg Ser Val Thr Glu Bet Clu Val Het Cln Phe Leu 

1000 1005 1010 

AAC CGG AAC CAA CGT CTG CAG CCC TCT GCC CAA GCT CTC ATC TAC TTC 3242 
Asn Arg Asn Gin Arg Leu Gin Pro Ser Ala Gin Ala Leu He Tyr Phe 

1015 1020 1025 

CGG GAT TCC AGC TTC CTC AGC TCT GTG CCA GAT GCC TGG AAA TCT GAC 3290 
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Arg Asp Ser Ser Phe Leu Ser Ser Val Pro Asp Ala Trp Lys Ser Asp 
1030 1035 J040 ,045 

TTT GTT TCT GAG TCT GAA GAG GCC GCA TGT CGG ATC TCA GAA CTG AAG 3338 
Phe Val Ser Glu Ser Glu Glu Ala Ala Cys Arg lie Ser Glu Leu Lys 

1050 1055 1060 

AGC TAC CTA AGC AGA CAG AAA GGG ATA ACC TGC CGC AGA TAC CCC TGT 3386 
Ser Tyr Leu Ser Arg Gin Lys Gly He Thr Cys Arg Arg Tyr Pro Cys 

1065 1070 1075 

GAG TGG GGG GGT GTG GCA GCT GGC CGG CCC TAT GTT GGC GGG CTG GAG 3434 
Glu Trp Gly Gly Val Ala Ala Gly Arg Pro Tyr Val Gly Gly Leu Glu 

1080 1085 1090 

GAG TTT GGG CAG TTG GTT CTG CAG GAT GTA TGG AAT ATG ATC CAG AAG 3482 
Glu Phe Gly Gin Leu Val Leu Gin Asp Val Trp Asn Met He Gin Lys 
1095 hoo U05 

CTC TAC CTG CAG CCT GGG GCC CTG CTG GAG CAG CCA GTG TCC ATC CCA 3530 
Leu Tyr Leu Gin Pro Gly Ala Leu Leu Glu Gin Pro Val Ser He Pro 
1110 H15 1120 i 125 

GAC GAT GAC TTG GTC CAG GCC ACC TTC CAG CAG CTG CAG AAG CCA CCG 3578 
Asp Asp Asp Leu Val Gin Ala Thr Phe Gin Gin Leu Gin Lys Pro Pro 

U30 1135 U4 0 

ACT CCT GCC CGG CCA CGC CTT CTT CAG GAC ACA GTG CAA CGG CTG ATG 3626 
Ser Pro Ala Arg Pro Arg Leu Leu Gin Asp Thr Val Gin Arg Leu Met 

H45 U50 1155 

CTG CCC CAC GGA ACG CTC AGC CTG GTG ACG GGG CAG TCA GGA CAG GGC 3674 • 
Leu Pro His Gly Arg Leu Ser Leu Val Thr Gly Gin Ser Gly Gin Gly 

1160 1165 H70 

AAG ACA GCC TTC CTG GCA TCT CTT GTG TCA GCC CTG CAG CCT CCT GAT 3722 
Lys Thr Ala Phe Leu Ala Ser Leu Val Ser Ala Leu Gin Ala Pro Asp 
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1175 1180 1185 

GGG GCC AAG GTG GCA CCA TTA GTC TTC TTC CAC TTT TCT GGG GCT CGT 3770 
Gly Ala Lys Val Ala Pro Leu Val Phe Phe His Phe Ser Gly Ala Arg 
1190 1195 1200 1205 

CCT GAC CAG GGT CTT GCC CTC ACT CTG CTC AGA CGC CTC TGT ACC TAT 3818 
Pro Asp Gin Gly Leu Ala Leu Thr Leu Leu Arg Arg Leu Cys Thr Tyr 

1210 1215 1220 

CTG CGT GGC CAA CTA AAA GAG TCA GGT GCC CTC CCC AGC ACC TAC CGA 3866 
Leu Arg Gly Gin Leu Lys Glu Ser Gly Ala Leu Pro Ser Thr Tyr Arg 

1225 1230 1235 

AGC CTG GTG TGG GAG CTG CAG CAG AGG CTG CTG CCC AAG TCT GCT GAG 3914 
Ser Leu Val Trp Glu Leu Gin Gin Arg Leu Leu Pro Lys Ser Ala Glu 

1240 1245 1250 

TCC CTG CAT CCT GGC CAG ACC CAG GTC CTG ATC ATC GAT GGG GCT GAT 3962 
Ser Leu His Pro Gly Gin Thr Gin Val Leu He He Asp Gly Ala Asp 

1255 1260 1265 

AGG TTA GTG GAC CAG AAT GGG CAG CTG ATT TCA GAC TGG ATC CCA AAG 4010 
Arg Leu Val Asp Gin Asn Gly Gin Leu lie Ser Asp Trp He Pro Lys 
1270 1275 1280 1285 

AAG CTT CCC CGG TGT GTA CAC CTG GTG CTG ACT GTG TCT ACT GAT GCA 4058 
Lys Leu Pro Arg Cys Val His Leu Val Leu Ser Val Ser Ser Asp Ala 

1290 1295 1300 

GGC CTA GGG GAG ACC CTT GAG CAG AGC CAG GGT GCC CAC GTG CTG GCC 4106 
Gly Leu Gly Glu Thr Leu Glu Gin Ser Gin Gly Ala His Val Leu Ala 

1305 1310 1315 

TTG GGG CCT CTG GAG GCC TCT GCT CGG GCC CGG CTG GTG AGA GAG GAG 4154 
Leu Gly Pro Leu Glu Ala Ser Ala Arg Ala Arg Leu Val Arg Glu Glu 
1320 1325 1330 
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CTC GCC CTC TAC GGG AAC CGG CTG GAG GAG TCA CCA TTT AAC AAC CAG 4202 
Leu Ala Leu Tyr Gly Lys Arg Leu Clu Glu Ser Pro Phe Asn Asn Gin 

1335 1340 1345 

ATG CGA CTG CTG CTG CTG AAG CGG GAA TCA CGC CGG CCG CTC TAC CTG 4250 
Bet Arg Leu Leu Leu Val Lys Arg Glu Ser Gly Arg Pro Leu Tyr Leu 
1350 13 *5 1360 1365 

CGC TTG CTC ACC GAT CAC CTC AGG CTC TTC ACG CTG TAT CAG CAG GTG 4298 
Arg Leu Val Thr Asp His Leu Arg Leu Phe Thr Leu Tyr Clu Gin Val 

1370 1375 1380 

TCT GAG AGA CTC CGG ACC CTG CCT GCC ACT GTC CCC CTG CTG CAG CAC 4346 
Ser Glu Arg Leu Arg Thr Leu Pro Ala Thr Val Pro Leu Leu Gin- His 

1385 1390 1395 

ATC CTG AGC ACA CTG GAG AAG GAG CAC GGG CCT GAT GTC CTT CCC CAG 4394 
He Leu Ser Thr Leu Glu Lys Glu His Gly Pro Asp Val Leu Pro Gin 

1400 1405 1410 

GCC TTG ACT GCC CTA GAA GTC ACA CGC ACT GGT TTG ACT GTC GAC CAG 4442 
Ala Leu Thr Ala Leu Glu Val Thr Arg Ser Gly Leu Thr Val Asp Gin 

1415 1420 1425 

CTG CAC GGA CTG CTG ACT GTG TGG CGC ACA CTA CCC AAG GGC ACT AAG 4490 
Leu His Gly Val Leu Ser Val Trp Arg Thr Leu Pro Lys Gly Thr Lys 
1430 1435 U40 i4 4 5 

ACC TGG GAA GAA GCA GTG GCT GCT GGT AAC ACT GGA GAC CCC TAC CCC 4538 
Thr Trp Glu Glu Ala Val Ala Ala Gly Asn Ser Gly Asp Pro Tyr Pro 

1450 1455 M 60 

ATG GGC CCG TTT CCC TAC CTC GTC CAG ACT CTG CGC ACT TTC CTA GCC 4586 
Met Gly Pro Phe Ala Tyr Leu Val Gin Ser Leu Arg Ser Leu Leu Gly 

1465 1470 1475 

GAG GGC CCT CTG GAG CCC CCT GGT GCC CGG CTG TGC CTC CCT CAT GGG 4634 
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Glu Cly Pro Leu Clu Arg Pro Gly Ala Arg Leu Cys Leu Pro Asp Gly 

1480 1485 H90 

CCC CTG AGA ACA GCA GCT AAA CGT TGC TAT GGG AAG ACG CCA GGG CTA 4682 
Pro Leu Arg Thr Ala Ala Lys Arg Cys Tyr Gly Lys Arg Pro Gly Leu 

1495 1500 1505 

GAG GAC ACG GCA CAC ATC CTC ATT GCA GCT CAG CTC TGG AAG ACA TGT 4730 
Glu Asp Thr Ala His He Leu lie Ala Ala Gin Leu Trp Lys Thr Cys 
1510 1515 1520 1525 

GAC GCT GAT CCC TCA GGC ACC TTC CGA ACT TGC CCT CCT GAG GCT CTG 4778 
Asp Ala Asp Ala Ser Gly Thr Phe Arg Ser Cys Pro Pro Glu Ala Leu 

lg 30 1535 1540 

GGA GAC CTG CCT TAC CAC CTG CTC CAG AGC GGG AAC CGT GGA CTT CTT 4826 
Gly Asp Leu Pro Tyr His Leu Leu Gin Ser Gly Asn Arg Gly Leu Leu 

1545 1550 1555 

TCG AAG TTC CTT ACC AAC CTC CAT GTG GTG GCT GCA CAC TTG GAA TTG 4874 
Ser Lys Phe Leu Thr Asn Leu His Val Val Ala Ala His Leu Glu Leu 

1560 1565 1570 

GGT CTG GTC TCT CGG CTC TTG GAG GCC CAT GCC CTC TAT GCT TCT TCA 4922 
Gly Leu Val Ser Arg Leu Leu Glu Ala His Ala Leu Tyr Ala Ser Ser 

1575 1580 1585 

GTC CCC AAA GAG GAA CAA AAG CTC CCC GAG GCT GAC GTT GCA GTG TTT 4970 
Val Pro Lys Glu Glu Gin Lys Leu Pro Glu Ala Asp Val Ala Val Phe 
1590 1595 1600 1605 

CGC ACC TTC CTG AGG CAG CAG GCT TCA ATC CTC AGC CAG TAC CCC CGG 5018 
Arg Thr Phe Leu Arg Gin Gin Ala Ser He Leu Ser Gin Tyr Pro Arg 

l 6r0 1615 1620 

CTC CTG CCC CAG CAC GCA GCC AAC CAG CCC CTG GAC TCA CCT CTT TGC 5066 
Leu Leu Pro Gin Gin Ala Ala Asn Gin Pro Leu Asp Ser Pro Leu Cys 
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1625 1630 1635 

CAC CAA GCC TCG CTC CTC TCC CGG AGA TGG CAC CTC CAA CAC ACA CTA 5114 
His Gin Ala Ser Leu Leu Ser Arg Arg Trp His Leu Gin His Thr Leu 

1640 1645 1650 

CGA TGG CTT AAT AAA CCC CGG ACC ATG AAA AAT CAG CAA AGC TCC AGC 5162 
Arg Trp Leu Asn Lys Pro Arg Thr Het Lys Asn Gin Gin Ser Ser Ser 

1655 1660 1665 

CTG TCT CTG GCA GTT TCC TCA TCC CCT ACT GCT GTG GCC TTC TCC ACC 5210 
Leu Ser Leu Ala Val Ser Ser Ser Pro Thr Ala Val Ala Phe Ser Thr 
1670 1675 1680 1685 

AAT GGG CAA AGA GCA GCT GTG GGC ACT GCC AAT GGG ACA GTT TAC CTG 5258 
Asn Gly Gin Arg Ala Ala Val Gly Thr Ala Asn Gly Thr Val Tyr Leu 

1690 1695 1700 

TTG GAC CTG AGA ACT TGG CAG GAG GAG AAG TCT GTG GTG ACT GGC TGT 5306 
Leu Asp Leu Arg Thr Trp Gin Glu Glu Lys Ser Val Val Ser Gly Cys 

1705 1710 1715 

GAT GGA ATC TCT GCT TGT TTG TTC CTC TCC GAT GAC ACA CTC TTT CTT 5354 
Asp Gly lie Ser Ala Cys Leu Phe Leu Ser Asp Asp Thr Leu Phe Leu 

1720 1725 1730 

ACT GCC TTC GAC GGG CTC CTG CAG CTC TGG GAC CTG CAG CAT GGT TGT 5402 
Thr Ala Phe Asp Gly Leu Leu Glu Leu Trp Asp Leu Gin His Gly Cys 

1735 1740 1745 

CGG GTG CTG CAG ACT AAG GCT CAC CAG TAC CAA ATC ACT GGC TGC TGC 5450 
Arg Val Leu Gin Thr Lys Ala His Gin Tyr Gin lie Thr Gly Cys Cys 
1750 1755 1760 1765 

CTG AGC CCA GAC TGC CGG CTG CTA GCC ACC GTG TGC TTG GGA GGA TGC 5498 
Leu Ser Pro Asp Cys Arg Leu Leu Ala Thr Val Cys Leu Gly Gly Cys 
1770 1775 1780 
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CTA AAG CTC TGG CAC ACA CTC CGT GGG CAG CTG GCC TTC CAG CAC ACC 5546 
Leu Lys Leu Trp Asp Thr Val Arg Gly Gin Leu Ala Phe Gin His Thr 

1785 1790 1795 

TAC CCC AAG TCC CTG AAC TGT GTT GCC TTC CAC CCA GAG GGG CAG GTA 5594 
Tyr Pro Lys Ser Leu Asn Cys Val Ala Phe His Pro Glu Gly Gin Val 

1800 1805 1810 

ATA GCC ACA GGC AGC TGG GCT GGC AGC ATC AGC TTC TTC CAG GTG GAT 5642 
lie Ala Thr Gly Ser Trp Ala Gly Ser He Ser Phe Phe Gin Val Asp 

1815 1820 1825 

GGG CTC AAA GTC ACC AAG GGA CCT GGG GGC CCC GGA GCC TCT ATC CGT 5690 
Gly Leu Lys Yal Thr Lys Gly Pro Gly Gly Pro Gly Ala Ser lie Arg 
1830 1835 1840 1845 

ACC TTG GCC TTC AAT GTG CCT GGG GGG GTT GTG GCT GTG GGC CGG CTG 5738 
Thr Leu Ala Phe Asn Val Pro Gly Gly Val Val Ala Val Gly Arg Leu 

1850 1855 1860 

GAC ACT ATG GTG GAG CTG TGG GCC TGG CGA GAA GGG GCA CGG CTG GCT 5786 
Asp Ser Het Val Glu Leu Trp Ala Trp Arg Glu Gly Ala Arg Leu Ala 

1865 1870 1875 

GCC TTC CCT GCC CAC CAT GGC TTT GTT GCT GCT GCG CTT TTC CTG CAT 5834 
Ala Phe Pro Ala His His Gly Phe Val Ala Ala Ala Leu Phe Leu His 

1880 1885 1890 

GCG GGT TGC CAG TTA CTG ACG GCT GGA GAG GAT GGC AAG GTT CAG GTG 5882 
Ala Gly Cys Gin Leu Leu Thr Ala Gly Glu Asp Gly Lys Val Gin Val 

1895 1900 1905 

TGG TCA GGG TCT CTG GGT CGG CCC CGT GGG CAC CTG GGT TCC CTT TCT 5930 
Trp Ser Gly Ser Leu Gly Arg Pro Arg Gly His Leu Gly Ser Leu Ser 
1910 1915 1920 1925 

CTC TCT CCT GCC CTC TCT GTG GCA CTC AGC CCA GAT GGT GAT CGG GTG 5978 
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Leu Ser Pro Ala Leu Ser Val Ala Leu Ser Pro Asp Cly Asp Arg Val 

1930 1935 1940 

GCT GTT GCA TAT CGA GCG GAT GGC ATT AGG ATC TAC AAA ATC TCT TCA 6026 
Ala Val Gly Tyr Arg Ala Asp Gly He Arg He Tyr Lys lie Ser Ser 

1945 1950 1955 

GGT TCC CAG GGG GCT CAG GGT CAG GCA CTG GAT GTG GCA GTG TCG GCC 6074 
Gly Ser Gin Gly Ala Gin Gly Gin Ala Leu Asp Val Ala Val Ser Ala 

1960 1965 1970 

CTG GCC TGG ATA AGC CCC AAG GTA TTG GTG AGT GGT GCA GAA GAT GCG 6122 
Leu Ala Trp lie Ser Pro Lys Val Leu Val Ser Gly Ala Glu Asp Gly 

1975 1980 1985 

TCC TTG CAG GGC TGG GCA CTC AAG GAA TGC TCC CTT CAG TCC CTC TGG 6170 
Ser Leu Gin Gly Trp Ala Leu Lys Glu Cys Ser Leu Gin Ser Leu Trp 
1990 1995 2000 2005 

CTC CTG TCC AGA TTC CAG AAG CCT GTG CTA GGA CTG GCC ACT TCC CAG 6218 
Leu Leu Ser Arg Phe Gin Lys Pro Val Leu Gly Leu Ala Thr Ser Gin 

2010 2015 2020 

GAG CTC TTG GCT TCT GCC TCA GAG GAT TTC ACA GTG CAG CTG TGG CCA 6266 
Glu Leu Leu Ala Ser Ala Ser Glu Asp Phe Thr Val Gin Leu Trp Pro 

2025 2030 2035 

AGG CAG CTG CTG ACG CGG CCA CAC AAG GCA GAA GAC TTT CCC TGT GGC 6314 
Arg Gin Leu Leu Thr Arg Pro His Lys Ala Glu Asp Phe Pro Cys Gly 

2040 2045 2050 

ACT GAG CTG CGG GGA CAT GAG GGC CCT GTG AGC TGC TGT AGT TTC AGC 6362 
Thr Glu Leu Arg Gly His Glu Gly Pro Val Ser Cys Cys Ser Phe Ser 

2055 2060 2065 

ACT GAT GGA GGC AGC CTG GCC ACC GGG GGC CGG GAT CGG AGT CTC CTC 6410 
Thr Asp Gly Gly Ser Leu Ala Thr Gly Gly Arg Asp Arg Ser Leu Leu 
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2070 2075 2080 2085 



6554 



6602 



6650 



TCC TGC CAC CTC AGG ACA CCC AAA ACC CCT GTT TTC ATC CAC TCC TTC 
Cys Trp Asp Val Arg Thr Pro Lys Thr Pro Val Leu He His Ser Phe 

2080 2095 2100 

CCT CCC TGT CAC CCT GAC TGG GTC ACT GGC TGT GCC TGG ACC AAA GAT 
Pro Ala Cys His Arg Asp Trp Val Thr Cly Cys Ala Trp Thr Lys Asp 

2105 2110 2115 

AAC CTA CTG ATA TCC TGC TCC ACT GAT GGC TCT GTG GGG CTC TGG GAC 
Asn Leu Leu He Ser Cys Ser Ser Asp Gly Ser Val Cly Leu Trp Asp 

2,20 2125 2130 

CCA GAG TCA GGA CAG CGG CTT GGT CAC TTC CTG GGT CAT CAG ACT GCT 
Pro Glu Ser Gly Gin Arg Leu Gly Gin Phe Leu Gly His Gin Ser Ala 

2135 2140 2H5 

CTG AGC GCT GTG GCA GCT GTG GAG GAG CAC GTG GTG TCT GTG AGC CGG 
Val Ser Ala Val Ala Ala Val Glu Glu His Val Val Ser Val Ser Arg 
2150 2155 2160 2165 

GAT GGG ACC TTG AAA GTG TGG GAC CAT CAA GGC GTG GAG CTG ACC AGC 
Asp Gly Thr Leu Lys Val Trp Asp His Gin Gly Val Glu Leu Thr Ser 
2170 2I75 21g0 

He Pro Ala His Ser Gly Pro ,l e Ser His Cys Ala Ala Ala Met Glu 

2185 2190 2195 

CCC CGT GCA GCT GGA CAG CCT GGG TCA GAG CTT CTG GTG GTA ACC ATC 6794 
Pro Arg Ala Ala Gly Gin Pro Gly Ser Glu Leu Leu Val Val Thr lie 

2200 2205 2210 

GGG CTA GAT GGG GCC ACA CGG TTA TGG CAT CCA CTC TTG GTG TGC CAA 6842 
Gly Leu Asp Gly Ala Thr Arg Leu Trp His Pro Leu Leu Val Cys Gin 
2215 2220 2225 
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ACC CAC ACC CTC CTC GGA CAC AGC GGC CCA GTC CGT GCT GCT GCT GTT 6890 
Thr His Thr Leu Leu Gly His Ser Gly Pro Val Arg Ala Ala Ala Val 
2230 2235 2240 2245 

TCA GAA ACC TCA GCC CTC ATG CTG ACC GCC TCT GAG ATG TCT GTA CGG 6938 
Ser Glu Thr Ser Ala Leu Met Leu Thr Ala Ser Glu Met Ser Val Arg 

2250 2255 2260 

CTC TGG CAG GTT CCT AAG GAA GCA GAT GAC ACA TGT ATA CCA AGG ACT 6986 
Leu Trp Gin Val Pro Lys Glu Ala Asp Asp Thr Cys He Pro Arg Ser 

2265 2270 2275 

TCT GCA GCC GTC ACT GCT GTG GCT TGG GCA CCA GAT GGC TCC ATG GCA 7034 
Ser Ala Ala Val Thr Ala Val Ala Trp Ala Pro Asp Gly Ser Her Ala 

2280 2285 2290 

GTA TCT GGA AAT CAA GCT GGG GAA CTA ATC TTG TGG CAG GAA GCT AAG 7082 
Val Ser Gly Asn Gin Ala Gly Glu Leu He Leu Trp Gin Glu Ala Lys 

2295 2300 2305 

GCT GTG GCC ACA GCA CAG GCT CCA GGC CAC ATA GGT GCT CTG ATC TGG 7130 
Ala Val Ala Thr Ala Gin Ala Pro Gly His He Gly Ala Leu He Trp 
2310 2315 2320 232 5 

TCC TCG GCA CAC ACC TTT TTT GTC CTC ACT GCT GAT GAG AAA ATC ACC 7178 
Ser Ser Ala His Thr Phe Phe Val Leu Ser Ala Asp Glu Lys He Ser 

2330 2335 2340 

GAG TGG CAA GTG AAA CTG CGA GAG GGT TCG GCA CCC GGA AAT TTG ACT 7226 
Glu Trp Gin Val Lys Leu Arg Lys Gly Ser Ala Pro Gly Asn Leu Ser 

2345 2350 2355 

CTT CAC CTG AAC CGA ATT CTA CAG GAG GAC TTA GGG GTG CTG ACA ACT 7274 
Leu His Leu Asn Arg lie Leu Gin Glu Asp Leu Gly Val Leu Thr Ser 

2360 2365 2370 

CTG GAT TGG GCT CCT GAT GGT CAC TTT CTC ATC TTG GCC AAA GCA GAT 7322 
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Leu Asp Trp Ala Pro Asp Gly His Phe Leu He Leu Ala Lys Ala' Asp 

2375 2380 2385 

TTG AAG TTA CTT TGC ATG AAG CCA GGG GAT GCT CCA TCT GAA ATC TGG 7370 
Leu Lys Leu Leu Cys Het Lys Pro Gly Asp Ala Pro Ser Glu He Trp 
2390 2395 2400 2405 

AGC AGC TAT ACA GAA AAT CCT ATG ATA TTG TCC ACC CAC AAG GAA TAT 7418 
Ser Ser Tyr Thr Glu Asn Pro Met lie Leu Ser Thr His Lys Glu Tyr 

2410 2415 2420 

GGC ATA TTT GTC CTG CAG CCC AAG GAT CCT GGA GTT CTT TCT TTC TTG 7466 
Gly He Phe Yal Leu Gin Pro Lys Asp Pro Gly Val Leu Ser Phe Leu 

2425 2430 2435 

AGG CAA AAG GAA TCA GGA AAG TTT GAA GAG AGG CTG AAC TTT GAT ATA 7514 
Arg Gin Lys Glu Ser Gly Lys Phe Glu Glu Arg Leu Asn Phe Asp He 

2440 2445 2450 

AAC TTA GAG AAT CCT ACT AGG ACC CTA ATA TCG ATA ACT CAA GCC AAA 7562 
Asn Leu Glu Asn Pro Ser Arg Thr Leu He Ser He Thr Gin Ala Lys 

2455 2460 2465 

CCT GAA TCT GAG TCC TCA TTT TTG TGT GCC AGC TCT GAT GGG ATG CTA 7610 
Pro Glu Ser Glu Ser Ser Phe Leu Cys Ala Ser Ser Asp Gly Me.t Leu 
2470 2475 2480 2485 

TGG AAC CTG GCC AAA TGC AGC CCA GAA GGA GAA TGG ACC ACA GGT AAC 7658 
Trp Asn Leu Ala Lys Cys Ser Pro Glu Gly Glu Trp Thr Thr Gly Asn 

2490 2495 2500 

ATG TGG CAG AAA AAA GCA AAC ACT CCA GAA ACC CAA ACT CCA GGG ACA 7706 
Het Trp Gin Lys Lys Ala Asn Thr Pro Glu Thr Gin Thr Pro Gly Thr 

2505 2510 2515 

GAC CCA TCT ACC TGC AGG GAA TCT GAT GCC AGC ATG GAT ACT GAT GCC 7754 
Asp Pro Ser Thr Cys Arg Glu Ser Asp Ala Ser Het Asp Ser Asp Ala 
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2520 2525 2530 

ACC ATG CAT ACT GAG CCA ACA CCA CAT CTA AAG ACA CGG CAG CGT AGA 7802 
Ser Met Asp Ser Glu Pro Thr Pro His Leu Lys Thr Arg Gin Arg Arg 

2535 2540 2545 

AAG ATT CAC TCG GGC TCT GTC ACA GCC CTC CAT GTG CTA CCT CAG TTG 7850 
Lys He His Ser Gly Ser Val Thr Ala Leu His Val Leu Pro Glu Leu 
2550 2555 2560 2565 

CTG GTG ACA GCT TCG AAG GAC AGA GAT GTT AAG CTA TGG GAG AGA CCC 7898 
Leu Val Thr Ala Ser Lys Asp Arg Asp Val Lys Leu Trp Glu Arg Pro 

2570 2575 2580 

ACT ATG CAG CTG CTG GGC CTG TTC CGA TGC GAA GGG TCA GTG AGC TGC 7946 
Ser Met Gin Leu Leu Gly Leu Phe Arg Cys Glu Gly Ser Val Ser Cys 

2585 2590 2595 

CTG GAA CCT TGG CTG GGC GCT AAC TCC ACC CTG CAG CTT GCC GTG GGA 7994 
Leu Glu Pro Trp Leu Gly Ala Asn Ser Thr Leu Gin Leu Ala Val Gly 

2600 2605 2610 

GAC GTG CAG GGC AAT GTG TAC TTT CTG AAT TGG GAA TGAAGATGTG 8040 
Asp Val Gin Gly Asn Val Tyr Phe Leu Asn Trp Glu *** 

2615 2620 2625 

CCACTCGGGA ATAATGATAC CCCTTGTGCT AGAGATGCAA AGCCTGAAGA CACTGGTAGC 8100 
TTTTAATAAT TATAAAATTA ATAATTTCTT GATAATTATA AAAATGAAGT GTCAAAAAAT 8160 
CTCAAGTGTA GGCCTGCCTG TGTTCTCATG TGGATTTAGA ACAGGAGGAT ATTCTATGTG 8220 
TATGTATATG TACATTCTAA TGTGTGTCTC TTCTTATTCA ACATTAATCC TTACTAGAAC 8280 
CACAAGAAAG TGAATGAAAT CTTTAGTAGG TACTCTTTTG AAACTAGGTT TTAGAATTCT 8340 
TGCATCACTC GCGGGCCCTA GGACCCTAGG ATGCCATTCT TGCCAGGAGG AGGAATGAGA 8400 
GTGATGTTGG CCAACATTCA ATTTGAACAG AGCATGGAAG ACCTTTCAGT TCATCGGCAA 8460 
AGAATGAGGG AGGGAGAATA AGTCAGTCAT GCATCAGGGC ATTTAGAAAG AGCTATGTTT 8520 
CTGTCACAGA GACAGCCCTT TTCTCAGAAC TACCCAGAGG AGGCCGGCCA TGGTGGCTCA 8580 
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AGGTGGGCAG ATCACGAGGT CAGGAGATCA 8S40 
GTCTCTACTA AAAAATACAA AAAGTTGGCC 8700 
CTACTTGGGA GGCTGAGGCA GGAGAATGGC 8760 
CCGAGACACC ACTGCACTCC AGCCTGGGCA 8820 
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